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Diagnostics and Therapeutics for Diseases 
Associated with an IL-1 Inflammatory Haplotype 



1. Background of the Invention 



Genetics of the IL-1 Gene Cluster 

10 The IL-1 gene cluster is on the long arm of chromosome 2 (2ql3) and contains at 

least the genes for IL-la (IL-1A), IL-ip (IL-1B), and the IL-1 receptor antagonist (IL-1RN), 
within a region of 430 Kb (Nicklin, et al. (1994) Genomics, 19: 382-4). The agonist molecules, 
IL-la and IL-lp, have potent pro-inflammatory activity and are at the head of many 
inflammatory cascades. Their actions, often via the induction of other cytokines such as IL-6 and 

15 IL-8, lead to activation and recruitment of leukocytes into damaged tissue, local production of 
vasoactive agents, fever response in the brain and hepatic acute phase response. All three IL-1 
molecules bind to type I and to type II IL-1 receptors, but only the type I receptor transduces a 
signal to the interior of the cell. In contrast, the type II receptor is shed from the cell membrane 
and acts as a decoy receptor. The receptor antagonist and the type II receptor, therefore, are both 

20 anti-inflammatory in their actions. 

Inappropriate production of IL-1 plays a central role in the pathology of many 
autoimmune and inflammatory diseases, including rheumatoid arthritis, inflammatory bowel 
disorder, psoriasis, and the like. In addition, there are stable inter-individual differences in the 
rates of production of IL-1, and some of this variation may be accounted for by genetic 

25 differences at IL-1 gene loci. Thus, the IL-1 genes are reasonable candidates for determining 
part of the genetic susceptibility to inflammatory diseases, most of which have a multifactorial 
etiology with a polygenic component. 

Certain alleles from the IL-1 gene cluster are known to be associated with 
particular disease states. For example, IL-1RN (VNTR) allele 2 has been shown to be 

30 associated with osteoporosis (U.S. Patent No. 5,698,399), nephropathy in diabetes mellitus 
(Blakemore, et al. (1996) Hum. Genet. 97(3): 369-74), alopecia areata (Cork, et al, (1995) J. 
Invest. Dermatol. 104(5 Supp.): 15S-16S; Cork et al. (1996) Dermatol Clin 14: 671-8), Graves 
disease (Blakemore, et al. (1995) J. Clin. Endocrinol. 80(1): 111-5), systemic lupus 
erythematosus (Blakemore, et al. (1994) Arthritis Rheum. 37: 1380-85), lichen sclerosis (Clay, 
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et al. (1994) Hum. Genet. 94: 407-10), and ulcerative colitis (Mansfield, et al. (1994) 
Gastoenterol. 106(3): 637-42)). 

In addition, the IL-1A allele 2 from marker -889 and IL-1B (TaqI) allele 2 from 
marker +3954 have been found to be associated with periodontal disease (U.S. Patent No. 

5 5,686,246; Kornman and diGiovine (1998) Ann Periodont 3: 327-38; Hart and Kornman (1997) 
Periodontol 2000 14: 202-15; Newman (1997) Compend Contin Educ Dent 18: 881-4; Kornman 
et al. (1997) J. Clin Periodontol 24: 72-77). The IL-1A allele 2 from marker -889 has also been 
found to be associated with juvenile chronic arthritis, particularly chronic iridocyclitis 
(McDowell, et al. (1995) Arthritis Rheum. 38: 221-28 ). The IL-1B (TaqI) allele 2 from marker 

10 +3954 of IL-1B has also been found to be associated with psoriasis and insulin dependent 

diabetes in DR3/4 patients (di Giovine, et al. (1995) Cytokine 7: 606; Pociot, et al. (1992) Eur J. 
Clin. Invest. 22: 396-402). Additionally, the IL-1RN (VNTR) allele 1 has been found to be 
associated with diabetic retinopathy (see USSN 09/037472, and PCT/GB97/02790). 
Furthermore allele 2 of IL-1RN (VNTR) has been found to be associated with ulcerative colitis 

15 in Caucasian populations from North America and Europe (Mansfield, J. et al., (1994) 
Gastroenterology 106: 637-42). Interestingly, this association is particularly strong within 
populations of ethnically related Ashkenazi Jews (PCT WQ97/25445). 



Genotype Screening 

20 Traditional methods for the screening of heritable diseases have depended on 

either the identification of abnormal gene products {e.g., sickle cell anemia) or an abnormal 
phenotype {e.g., mental retardation). These methods are of limited utility for heritable diseases 
with late onset and no easily identifiable phenotypes such as, for example, vascular disease. 
With the development of simple and inexpensive genetic screening methodology, it is now 

25 possible to identify polymorphisms that indicate a propensity to develop disease, even when the 
disease is of polygenic origin. The number of diseases that can be screened by molecular 
biological methods continues to grow with increased understanding of the genetic basis of 

multifactorial disorders. 

Genetic screening (also called genotyping or molecular screening), can be broadly 
30 defined as testing to determine if a patient has mutations (alleles or polymorphisms) that either 
cause a disease state or are "linked" to the mutation causing a disease state. Linkage refers to the 
phenomenon th DNA sequences which are close together in the genome have a tendency to be 
inherited together. Two sequences may be linked because of some selective advantage of co- 
inheritance. More typically, however, two polymorphic sequences are co-inherited because of 
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the relative infrequency with which meiotic recombination events occur within the region 
between the two polymorphisms. The co-inherited polymorphic alleles are said to be in linkage 
disequilibrium with one another because, in a given human population, they tend to either both 
occur together or else not occur at all in any particular member of the population. Indeed, where 

5 multiple polymorphisms in a given chromosomal region are found to be in linkage 

disequilibrium with one another, they define a quasi-stable genetic "haplotype." In contrast, 
recombination events occurring between two polymorphic loci cause them to become separated 
onto distinct homologous chromosomes. If meiotic recombination between two physically 
linked polymorphisms occurs frequently enough, the two polymorphisms will appear to 

10 segregate independently and are said to be in linkage equilibrium. 

While the frequency of meiotic recombination between two markers is generally 
proportional to the physical distance between them on the chromosome, the occurrence of "hot 
spots" as well as regions of repressed chromosomal recombination can result in discrepancies 
between the physical and recombinational distance between two markers. Thus, in certain 

15 chromosomal regions, multiple polymorphic loci spanning a broad chromosomal domain may 
be in linkage disequilibrium with one another, and thereby define a broad-spanning genetic 
haplotype. Furthermore, where a disease-causing mutation is found within or in linkage with 
this haplotype, one or more polymorphic alleles of the haplotype can be used as a diagnostic or 
prognostic indicator of the likelihood of developing the disease. This association between 

20 otherwise benign polymorphisms and a disease-causing polymorphism occurs if the disease 
mutation arose in the recent past, so that sufficient time has not elapsed for equilibrium to be 
achieved through recombination events. Therefore identification of a human haplotype which 
spans or is linked to a disease-causing mutational change, serves as a predictive measure of an 
individual's likelihood of having inherited that disease-causing mutation. Importantly, such 

25 prognostic or diagnostic procedures can be utilized without necessitating the identification and 
isolation of the actual disease-causing lesion. This is significant because the precise 
determination of the molecular defect involved in a disease process can be difficult and 
laborious, especially in the case of multifactorial diseases such as inflammatory disorders. 

Indeed, the statistical correlation between an inflammatory disorder and an IL-1 

30 polymorphism does not necessarily indicate that the polymorphism directly causes the disorder. 
Rather the correlated polymorphism may be a benign allelic variant which is linked to (i.e. in 
linkage disequilibrium with) a disorder-causing mutation which has occurred in the recent human 
evolutionary past, so that sufficient time has not elapsed for equilibrium to be achieved through 
recombination events in the intervening chromosomal segment. Thus, for the purposes of 
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diagnostic and prognostic assays for a particular disease, detection of a polymorphic allele 
associated with that disease can be utilized without consideration of whether the polymorphism 
is directly involved in the etiology of the disease. Furthermore, where a given benign 
polymorphic locus is in linkage disequilibrium with an apparent disease-causing polymorphic 

5 locus, still other polymorphic loci which are in linkage disequilibrium with the benign 
polymorphic locus are also likely to be in linkage disequilibrium with the disease-causing 
polymorphic locus. Thus these other polymorphic loci will also be prognostic or diagnostic of 
the likelihood of having inherited the disease-causing polymorphic locus. Indeed, a broad- 
spanning human haplotype (describing the typical pattern of co-inheritance of alleles of a set of 

10 linked polymorphic markers) can be targeted for diagnostic purposes once an association has 
been drawn between a particular disease or condition and a corresponding human haplotype. 
Thus, the determination of an individual's likelihood for developing a particular disease of 
condition can be made by characterizing one or more disease-associated polymorphic alleles (or 
even one or more disease-associated haplotypes) without necessarily determining or 

15 characterizing the causative genetic variation. 

2. Summary of the Invention 

In one aspect, the present invention provides novel methods and kits for 
determining whether a subject has or is predisposed to developing a disease or condition that is 

20 associated with an IL-1 polymorphism. In one embodiment, the method comprises determining 
whether the subject's nucleic acids contain a marker or allele comprising an IL-1 inflammatory 
haplotype. In a preferred embodiment, the IL-1 inflammatory haplotype is indicative of 
increased II- 1 agonist (e.g. IL-1 (441 12332)). In another preferred embodiment, the IL-1 
inflammatory haplotype is indicative of decreased IL-1 receptor antagonist activity (e.g. IL-1 

25 (33441461)). 

An allele comprising an IL-1 inflammatory haplotype can be detected by any of a 
variety of available techniques, including: 1) performing a hybridization reaction between a 
nucleic acid sample and a probe that is capable of hybridizing to the allele; 2) sequencing at least 
a portion of the allele; or 3) determining the electrophoretic mobility of the allele or fragments 
30 thereof (e.g., fragments generated by endonuclease digestion). The allele can optionally be 
subjected to an amplification step prior to performance of the detection step. Preferred 
amplification methods are selected from the group consisting of: the polymerase chain reaction 
(PCR), the ligase chain reaction (LCR), strand displacement amplification (SDA), cloning, and 
variations of the above (e.g. RT-PCR and allele specific amplification). Oligonucleotides 
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necessary for amplification may be selected, for example, from within the IL-1 gene loci, either 
flanking the marker of interest (as required for PCR amplification) or directly overlapping the 
marker (as in ASO hybridization). In a particularly preferred embodiment, the sample is 
hybridized with a set of primers, which hybridize 5' and 3 r in a sense or antisense sequence to the 

5 vascular disease associated allele, and is subjected to a PCR amplification. 

An allele comprising an IL-1 inflammatory haplotype may also be detected 
indirectly, e.g. by analyzing the protein product encoded by the DNA. For example, where the 
marker in question results in the translation of a mutant protein, the protein can be detected by 
any of a variety of protein detection methods. Such methods include immunodetection and 

10 biochemical tests, such as size fractionation, where the protein has a change in apparent 
molecular weight either through truncation, elongation, altered folding or altered post- 
translational modifications. 

In another aspect, the invention features kits for performing the above-described 
assays. The kit can include a nucleic acid sample collection means and a means for determining 

15 whether a subject carries at least one allele comprising an IL-1 inflammatory haplotype. The kit 
may also contain a control sample either positive or negative or a standard and/or an algorithmic 
device for assessing the results and additional reagents and components including: DNA 
amplification reagents, DNA polymerase, nucleic acid amplification reagents, restrictive 
enzymes, buffers, a nucleic acid sampling device, DNA purification device, deoxynucleotides, 

20 oligonucleotides (e.g. probes and primers) etc.. 

As described above, the control may be a positive or negative control. Further, 
the control sample may contain the positive (or negative) products of the allele detection 
technique employed. For example, where the allele detection technique is PCR amplification, 
followed by size fractionation, the control sample may comprise DNA fragments of the 

25 appropriate size. Likewise, where the allele detection technique involves detection of a mutated 
protein, the control sample may comprise a sample of mutated protein. However, it is preferred 
that the control sample comprises the material to be tested. For example, the controls may be a 
sample of genomic DNA or a cloned portion of the IL-1 gene cluster. Preferably, however, the 
control sample is a highly purified sample of genomic DNA where the sample to be tested is 

30 genomic DNA. 

The oligonucleotides present in said kit may be used for amplification of the 
region of interest or for direct allele specific oligonucleotide (ASO) hybridization to the markers 
in question. Thus, the oligonucleotides may either flank the marker of interest (as required for 
PCR amplification) or directly overlap the marker (as in ASO hybridization). 
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Information obtained using the assays and kits described herein (alone or in 
conjunction with information on another genetic defect or environmental factor, which 
contributes to the disease or condition that is associated with an IL-1 inflammatory haplotype) is 
useful for determining whether a non-symptomatic subject has or is likely to develop the 

5 particular disease or condition. In addition, the information can allow a more customized 
approach to preventing the onset or progression of the disease or condition. For example, this 
information can enable a clinician to more effectively prescribe a therapy that will address the 
molecular basis of the disease or condition. 

In yet a further aspect, the invention features methods for treating or preventing 

10 the development of a disease or condition that is associated with an IL-1 inflammatory haplotype 
in a subject by administering to the subject an appropriate therapeutic of the invention. In still 
another aspect, the invention provides in vitro or in vivo assays for screening test compounds to 
identify therapeutics for treating or preventing the development of a disease or condition that is 
associated with an IL-1 inflammatory haplotype. In one embodiment, the assay comprises — - 

15 contacting a cell transfected with a causative mutation that is operably linked to an appropriate 
promoter with a test compound and determining the level of expression of a protein in the cell in 
the presence and in the absence of the test compound. In a preferred embodiment, the causative 
mutation results in decreased production of IL-1 receptor antagonist, and increased production of 
the IL-1 receptor antagonist in the presence of the test compound indicates that the compound is 

20 an agonist of IL-1 receptor antagonist activity. In another preferred embodiment, the causative 
mutation results in increased production of IL-1 a or IL-lp , and decreased production of IL-1 a 
or IL-lp in the presence of the test compound indicates that the compound is an antagonist of IL- 
1 a or IL-lp activity. In another embodiment, the invention features transgenic non-human 
animals and their use in identifying antagonists of IL-la or IL-lp activity or agonists of IL-IRa 

25 activity. 

Other embodiments and advantages of the invention are set forth in the following 
detailed description and claims. 

3. Brief Description of the Figures 

30 

FIG. 1 is a schematic depiction of the IL-1 gene cluster including a few 
polymorphic markers. 
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FIG. 



2 is a graph which plots the correlation between disequilibrium values and 
physical distance as described herein. 



FIG. 3 shows the nucleic acid sequence for IL-1A (GEN X03833; SEQ ID No. 1). 
FIG. 4 shows the nucleic acid sequence for IL-1B (GEN X04500; SEQ ID No. 2). 
FIG. 5 shows the nucleic acid sequence for the secreted IL-1RN (GEN X64532; 
SEQ ID No. 3). 

4. Detailed Description of the Invention 

4.1 Definitions 

For convenience, the meaning of certain terms and phrases employed in the 
specification, examples, and appended claims is provided below. 

The term "allele" refers to the different sequence variants found at different 
polymorphic regions. For example, IL-1RN (VNTR) has at least five different alleles. The 
sequence variants may be single or multiple base changes, including without limitation 
insertions, deletions, or substitutions, or may be a variable number of sequence repeats. 

The term "allelic pattern" refers to the identity of an allele or alleles at one or 
more polymorphic regions. For example, an allelic pattern may consist of a single allele at a 
polymorphic site, as for IL-1RN (VNTR) allele 1, which is an allelic pattern having at least one 
copy of IL-1RN allele 1 at the VNTR of the IL-1RN gene loci. Alternatively, an allelic pattern 
may consist of either a homozygous or heterozygous state at a single polymorphic site. For 
example, IL1-RN (VNTR) allele 2,2 is an allelic pattern in which there are two copies of the 
second allele at the VNTR marker of IL-1RN that corresponds to the homozygous IL-RN 
(VNTR) allele 2 state. Alternatively, an allelic pattern may consist of the identity of alleles at 
more than one polymorphic site. 

The term "antibody " as used herein is intended to refer to a binding agent 
including a whole antibody or a binding fragment thereof which is specifically reactive with an 
IL-1 polypeptide. Antibodies can be fragmented using conventional techniques and the 
fragments screened for utility in the same manner as described above for whole antibodies. For 
example, F(ab)2 fragments can be generated by treating an antibody with pepsin. The resulting 
F(ab)2 fragment can be treated to reduce disulfide bridges to produce Fab fragments. The 
antibody of the present invention is further intended to include bispecific, single-chain, and 
chimeric and humanized molecules having affinity for an IL-1B polypeptide conferred by at least 
one CDR region of the antibody. 

"Biological activity" or "bioactivity" or "activity" or "biological function", which 
are used interchangeably, for the purposes herein means an effector or antigenic function that is 
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directly or indirectly performed by an IL-1 polypeptide (whether in its native or denatured 
conformation), or by any subsequence thereof Biological activities include binding to a target 
peptide, e.g., an IL-1 receptor. An IL-1 bioactivity can be modulated by directly affecting an IL- 
1 polypeptide. Alternatively, an IL-1 bioactivity can be modulated by modulating the level of an 

5 IL-1 polypeptide, such as by modulating expression of an IL-1 gene. 

As used herein the term "bioactive fragment of an IL-1 polypeptide" refers to a 
fragment of a full-length IL-1 polypeptide, wherein the fragment specifically mimics or 
antagonizes the activity of a wild-type IL-1 polypeptide. The bioactive fragment preferably is a 
fragment capable of interacting with an interleukin receptor. 

10 The term "an aberrant activity", as applied to an activity of a polypeptide such as 

IL-1, refers to an activity which differs from the activity of the wild-type or native polypeptide or 
which differs from the activity of the polypeptide in a healthy subject. An activity of a 
polypeptide can be aberrant because it is stronger than the activity of its native counterpart. 
Alternatively, an activity can be aberrant because it is weaker or absent relative to the activity of 

15 its native counterpart. An aberrant activity can also be a change in an activity. For example an 
aberrant polypeptide can interact with a different target peptide. A cell can have an aberrant IL-1 
activity due to overexpression or underexpression of an IL-1 locus gene encoding an IL-1 locus 
polypeptide. 

"Cells", "host cells" or "recombinant host cells" are terms used interchangeably 
20 herein to refer not only to the particular subject cell, but to the progeny or potential progeny of 
such a cell. Because certain modifications may occur in succeeding generations due to either 
mutation or environmental influences, such progeny may not, in fact be identical to the parent 
cell, but are still included within the scope of the term as used herein. 

A "chimera," "mosaic," "chimeric mammal" and the like, refers to a transgenic 
25 mammal with a knock-out or knock-in construct in at least some of its genome-containing cells. 

The terms "control" or "control sample" refer to any sample appropriate to the 
detection technique employed. The control sample may contain the products of the allele 
detection technique employed or the material to be tested. Further, the controls may be positive 
or negative controls. By way of example, where the allele detection technique is PCR 
30 amplification, followed by size fractionation, the control sample may comprise DNA fragments 
of an appropriate size. Likewise, where the allele detection technique involves detection of a 
mutated protein, the control sample may comprise a sample of a mutant protein. However, it is 
preferred that the control sample comprises the material to be tested. For example, the controls 
may be a sample of genomic DNA or a cloned portion of the IL-1 gene cluster. However, where 
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the sample to be tested is genomic DNA, the control sample is preferably a highly purified 

sample of genomic DNA. 

The phrase "diseases and conditions associated with IL-1 polymorphisms" refers 
to a variety of diseases or conditions, the susceptibility to which can be indicated in a subject 
5 based on the identification of one or more alleles within the IL-1 complex. Examples include: 
inflammatory or degenerative disease, including: Systemic Inflammatory Response (SIRS); 
Alzheimer's Disease (and associated conditions and symptoms including: chronic 
neuroinflammation, glial activation; increased microglia; neuritic plaque formation; and response 
to therapy); Amylotropic Lateral Sclerosis (ALS), arthritis (and associated conditions and 
10 symptoms including: acute joint inflammation, antigen-induced arthritis, arthritis associated with 
chronic lymphocytic thyroiditis, collagen-induced arthitis, juvenile chronic arthritis; juvenile 
rheumatoid arthritis, osteoarthritis, prognosis and streptococcus-induced arthritis), asthma (and 
associated conditions and symptoms, including: bronchial asthma; chronic obstructive airway 
disease; chronic obstructive pulmonary disease, juvenile asthma and occupational asthma); 
15 cardiovascular diseases (and associated conditions and symptoms, including atherosclerosis; 
autoimmune myocarditis, chronic cardiac hypoxia, congestive heart failure, coronary artery 
disease, cardiomyopathy and cardiac cell dysfunction, including: aortic smooth muscle cell 
activation; cardiac cell apoptosis; and immunomodulation of cardiac cell function; diabetes and 
associated conditions and symptoms, including autoimmune diabetes, insulin-dependent (Type 
20 1) diabetes, diabetic periodontitis, diabetic retinopathy, and diabetic nephropathy); 

gastrointestinal inflammations (and related conditions and symptoms, including celiac disease, 
associated osteopenia, chronic colitis, Crohn's disease, inflammatory bowel disease and 
ulcerative colitis); gastric ulcers; hepatic inflammations, cholesterol gallstones and hepatic 
fibrosis, HIV infection (and associated conditions and symptoms, including degenerative 
25 responses, neurodegenerative responses, and HIV associated Hodgkin's Disease), Kawasaki's 
Syndrome (and associated diseases and conditions, including mucocutaneous lymph node 
syndrome, cervical lymphadenopathy, coronary artery lesions, edema, fever, increased 
leukocytes, mild anemia, skin peeling, rash, conjunctiva redness, thrombocytosis; multiple 
sclerosis, nephropathies (and associated diseases and conditions, including diabetic nephropathy, 
30 endstage renal disease, glomerulonephritis, Goodpasture's syndrome, hemodialysis survival and 
renal ischemic reperfusion injury), neurodegenerative diseases (and associated diseases and 
conditions, including acute neurodegeneration, induction of IL-1 in aging and neurodegenerative 
disease, IL-1 induced plasticity of hypothalamic neurons and chronic stress 
hyperresponsiveness), Qphthalmopathies (and associated diseases and conditions, including 
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diabetic retinopathy, Graves' Ophthalmopathy, and uveitis, osteoporosis (and associated diseases 
and conditions, including alveolar, femoral, radial, vertebral or wrist bone loss or fracture 
incidence, postmenopausal bone loss, mass, fracture incidence or rate of bone loss), otitis media 
(adult or pediatric), pancreatis or pancreatic acinitis, periodontal disease (and associated diseases 

5 and conditions, including adult, early onset and diabetic); pulmonary diseases, including chronic 
lung disease, chronic sinusitis, hyaline membrane disease, hypoxia and pulmonary disease in 
SIDS; restenosis; rheumatism including rheumatoid arthritis , rheumatic aschoff bodies, 
rheumatic diseases and rheumatic myocarditis; thyroiditis including chronic lymphocytic 
thyroiditis;urinary tract infections including chronic prostatitis, chronic pelvic pain syndrome 

10 and urolithiasis. Immunological disorders, including autoimmune diseases, such as alopecia 
aerata, autoimmune myocarditis, Graves* disease, Graves ophthalmopathy, lichen sclerosis, 
multiple sclerosis, psoriasis, systemic lupus erythematosus, systemic sclerosis, thyroid diseases 
(e.g.goiter and struma lymphomatosa (Hashimoto's thyroiditis, lymphadenoid goiter), sleep 
disorders and chronic fatigue syndrome and obesity (non-diabetic or associated with diabetes). 

15 Resistance to infectious diseases, such as Leishmaniasis, Leprosy, Lyme Disease, Lyme Carditis, 
malaria, cerebral malaria, meningititis, tubulointestitial nephritis associated with malaria), which 
are caused by bacteria, viruses (e.g. cytomegalovirus, encephalitis, Epstein-Barr Virus, Human 
Immunodeficiency Virus, Influenza Virus) or protozoans (e.g., Plasmodium falciparum, 
trypanosomes). Response to trauma, including cerebral trauma (including strokes and ischemias, 

20 encephalitis, encephalopathies, epilepsy, perinatal brain injury, prolonged febrile seizures, SIDS 
and subarachnoid hemorrhage), low birth weight (e.g. cerebral palsy), lung injury (acute 
hemorrhagic lung injury, Goodpasture's syndrome, acute ischemic reperfusion), myocardial 
dysfunction, caused by occupational and environmental pollutants (e.g. susceptibility to toxic oil 
syndrome silicosis), radiation trauma, and efficiency of wound healing responses (e.g. burn or 

25 thermal wounds, chronic wounds, surgical wounds and spinal cord injuries). Susceptibility to 
neoplasias, including breast cancer associated osteolytic metastasis, cachexia, colorectal cancer, 
hyperproliferative diseases, Hodgkin's disease, leukemias, lymphomas, metabolic diseases and 
tumors, metastases, myeolomas, and various cancers (including breast prostate ovarian, colon, 
lung, etc), anorexia and cachexia. Hormonal regulation including fertility/fecundity, likelihood 

30 of a pregnancy, incidence of preterm labor, prenatal and neonatal complications including 
preterm low birth weight, cerebral palsy, septicemia, hypothyroxinernia, oxygen dependence, 
cranial abnormality, early onset menopause. A subject's response to transplant (rejection or 
acceptance), acute phase response (e.g. febrile response), general inflammatory response, acute 
respiratory distress response, acute systemic inflammatory response, wound healing, adhesion, 
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immunomflammatory response, neuroendocrine response, fever development and resistance, 
acute-phase response, stress response, disease susceptibility, repetitive motion stress, tennis 
elbow, and pain management and response. 

The phrases "disruption of the gene" and "targeted disruption" or any similar 
5 phrase refers to the site specific interruption of a native DNA sequence so as to prevent 
expression of that gene in the cell as compared to the wild-type copy of the gene. The 
interruption may be caused by deletions, insertions or modifications to the gene, or any 
combination thereof. 

The term "haplotype" as used herein is intended to refer to a set of alleles that are 
10 inherited together as a group (are in linkage disequilibrium) at statistically significant levels (p corr 

< 0.05). As used herein, the phrase "an IL-1 haplotype" refers to a haplotype in the IL-1 loci. 

An IL-1 inflammatory or proinflammatory haplotype refers to a haplotype that is indicative of 

increased agonist and/or decreased antagonist activities. 

The terms "IL-1 gene cluster" and "IL-1 loci" as used herein include all the 
15 nucleic acid at or near the 2ql3 region of chromosome 2, including at least the IL-1 A, IL-1B and 

IL-1RN genes and any other linked sequences. (Nicklin et aL, Genomics 19: 382-84, 1994). 

The terms "IL-1 A", "IL-1B", and "IL-1RN" as used herein refer to the genes coding for IL-1 , 

IL-1 , and IL-1 receptor antagonist, respectively. The gene accession number for IL-1 A, IL-1B, 

and IL-1RN are X03833, X04500, and X64532, respectively. 
20 "IL-1 functional mutation" refers to a mutation within the IL-1 gene cluster that 

results in an altered phenotype (i.e. effects the function of an IL-1 gene or protein). Examples 

include: IL-lA(+4845) allele 2, IL-1B (+3954) allele 2, IL-1B (+6912) allele 2 and IL-1RN 

(+2018) allele 2. 

"IL-1X (Z) allele Y " refers to a particular allelic form, designated Y, occurring at 
25 an IL-1 locus polymorphic site in gene X, wherein X is IL-1 A, B, or RN and positioned at or 
near nucleotide Z, wherein nucleotide Z is numbered relative to the major transcriptional start 
site, which is nucleotide +1, of the particular IL-1 gene X. As further used herein, the term "IL- 
IX allele (Z)" refers to all alleles of an IL-1 polymorphic site in gene X positioned at or near 
nucleotide Z. For example, the term "IL-1RN (+2018) allele" refers to alternative forms of the 
30 IL-1RN gene at marker +2018. "IL-1RN (+2018) allele 1" refers to a form of the IL-1RN gene 
which contains a cytosine (C) at position +2018 of the sense strand. Clay et aL, Hum. Genet 
97:723-26, 1996. "IL-1RN (+2018) allele 2" refers to a form of the IL-1RN gene which contains 
a thymine (T) at position +2018 of the plus strand. When a subject has two identical IL-1RN 
alleles, the subject is said to be homozygous, or to have the homozygous state. When a subject 
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has two different IL-1RN alleles, the subject is said to be heterozygous, or to have the 
heterozygous state. The term "IL-1RN (+2018) allele 2,2" refers to the homozygous IL-1 RN 
(+2018) allele 2 state. Conversely, the term "IL-1RN (+2018) allele 1,1" refers to the 
homozygous IL-1 RN (+2018) allele 1 state. The term "IL-1RN (+2018) allele 1,2" refers to the 

5 heterozygous allele 1 and 2 state. 

"IL-1 related" as used herein is meant to include all genes related to the human 
IL-1 locus genes on human chromosome 2 (2q 12-14). These include IL-1 genes of the human 
IL-1 gene cluster located at chromosome 2 (2q 13-14) which include: the IL-1 A gene which 
encodes interleukin-lcc, the IL-1B gene which encodes interleukin-1 p, and the IL-1RN (or IL- 

10 Ira) gene which encodes the interleukin-1 receptor antagonist. Furthermore these IL-1 related 
genes include the type I and type II human IL-1 receptor genes located on human chromosome 2 
(2ql2) and their mouse homologs located on mouse chromosome 1 at position 19.5 cM. 
Interleukin-1 a, interleukin-1 P, and interleukin-1 RN are related in so much as they all bind to IL- 
1 type I receptors, however only interleukin-1 a and interleukin-1 P are agonist ligands which 

15 activate IL-1 type I receptors, while interleukin-1 RN is a naturally occurring antagonist ligand. 
Where the term "IL-1" is used in reference to a gene product or polypeptide, it is meant to refer 
to all gene products encoded by the interleukin-1 locus on human chromosome 2 (2q 12-14) and 
their corresponding homologs from other species or functional variants thereof The term IL-1 
thus includes secreted polypeptides which promote an inflammatory response, such as IL-1 a and 

20 IL-1 p, as well as a secreted polypeptide which antagonize inflammatory responses, such as IL-1 
receptor antagonist and the IL-1 type II (decoy) receptor. 

An "IL-1 receptor" or "IL-1R" refers to various cell membrane bound protein 
receptors capable of binding to and/or transducing a signal from an IL-1 locus-encoded ligand. 
The term applies to any of the proteins which are capable of binding interleukin-1 (IL-1) 

25 molecules and, in their native configuration as mammalian plasma membrane proteins, 

presumably play a role in transducing the signal provided by IL-1 to a cell. As used herein, the 
term includes analogs of native proteins with IL-1 -binding or signal transducing activity. 
Examples include the human and murine IL-1 receptors described in U.S. Patent No. 4,968,607. 
The term "IL-1 nucleic acid" refers to a nucleic acid encoding an IL-1 protein. 

30 An "IL-1 polypeptide" and "IL-1 protein" are intended to encompass polypeptides 

comprising the amino acid sequence encoded by the IL-1 genomic DNA sequences shown in 
Figures 1, 2, and 3, or fragments thereof, and homologs thereof and include agonist and 
antagonist polypeptides. 
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"Increased risk" refers to a statistically higher frequency of occurrence of the 
disease or condition in an individual carrying a particular polymorphic allele in comparison to 
the frequency of occurrence of the disease or condition in a member of a population that does not 
carry the particular polymorphic allele. 

The term "interact" as used herein is meant to include detectable relationships or 
associations (e.g. biochemical interactions) between molecules, such as interactions between 
protein-protein, protein-nucleic acid, nucleic acid-nucleic acid and protein-small molecule or 
nucleic acid-small molecule in nature. 

The term "isolated" as used herein with respect to nucleic acids, such as DNA or 
RNA, refers to molecules separated from other DNAs, or RNAs, respectively, that are present in 
the natural source of the macromolecule. For example, an isolated nucleic acid encoding one of 
the subject IL-1 polypeptides preferably includes no more than 10 kilobases (kb) of nucleic acid 
sequence which naturally immediately flanks the IL-1 gene in genomic DNA, more preferably 
no more than 5kb of such naturally occurring flanking sequences, and most preferably less than 
1 .5kb of such naturally occurring flanking sequence. The term isolated as used herein also refers 
to a nucleic acid or peptide that is substantially free of cellular material, viral material, or culture 
medium when produced by recombinant DNA techniques, or chemical precursors or other 
chemicals when chemically synthesized. Moreover, an "isolated nucleic acid" is meant to 
include nucleic acid fragments which are not naturally occurring as fragments and would not be 
found in the natural state. The term "isolated" is also used herein to refer to polypeptides which 
are isolated from other cellular proteins and is meant to encompass both purified and 
recombinant polypeptides. 

A "knock-in" transgenic animal refers to an animal that has had a modified gene 
introduced into its genome and the modified gene can be of exogenous or endogenous origin. 

A "knock-out" transgenic animal refers to an animal in which there is partial or 
complete suppression of the expression of an endogenous gene (e.g, based on deletion of at least 
a portion of the gene, replacement of at least a portion of the gene with a second sequence, 
introduction of stop codons, the mutation of bases encoding critical amino acids, or the removal 
of an intron junction, etc.). 

A "knock-out construct" refers to a nucleic acid sequence that can be used to 
decrease or suppress expression of a protein encoded by endogenous DNA sequences in a cell. Ii 
a simple example, the knock-out construct is comprised of a gene, such as the IL-1RN gene, witl 
a deletion in a critical portion of the gene, so that active protein cannot be expressed therefrom. 
Alternatively, a number of termination codons can be added to the native gene to cause early 
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termination of the protein or an intron junction can be inactivated. In a typical knock-out 
construct, some portion of the gene is replaced with a selectable marker (such as the neo gene) so 
that the gene can be represented as follows: IL-1RN 5Vneo/ IL-1RN 3', where IL-1RN5' and IL- 
1RN 3\ refer to genomic or cDNA sequences which are, respectively, upstream and downstream 
relative to a portion of the IL-1RN gene and where neo refers to a neomycin resistance gene. In 
another knock-out construct, a second selectable marker is added in a flanking position so that 
the gene can be represented as: IL-lRN/neo/IL-lRN/TK, where TK is a thymidine kinase gene 
which can be added to either the IL-1RN5' or the IL-1RN3' sequence of the preceding construct 
and which further can be selected against (i.e. is a negative selectable marker) in appropriate 
media. This two-marker construct allows the selection of homologous recombination events, 
which removes the flanking TK marker, from non-homologous recombination events which 
typically retain the TK sequences. The gene deletion and/or replacement can be from the exons, 
introns, especially intron junctions, and/or the regulatory regions such as promoters. 

"Linkage disequilibrium" refers to co-inheritance of two alleles at frequencies 
greater than would be expected from the separate frequencies of occurrence of each allele in a 
given control population. The expected frequency of occurrence of two alleles that are 
inherited independently is the frequency of the first allele multiplied by the frequency of the 
second allele. Alleles that co-occur at expected frequencies are said to be in "linkage 
disequilibrium" . The cause of linkage disequilibrium is often unclear. It can be due to 
selection for certain allele combinations or to recent admixture of genetically heterogeneous 
populations. In addition, in the case of markers that are very tightly linked to a disease gene, 
an association of an allele (or group of linked alleles) with the disease gene is expected if the 
disease mutation occurred in the recent past, so that sufficient time has not elapsed for 
equilibrium to be achieved through recombination events in the specific chromosomal region. 
When referring to allelic patterns that are comprised of more than one allele, a first allelic 
pattern is in linkage disequilibrium with a second allelic pattern if all the alleles that comprise 
the first allelic pattern are in linkage disequilibrium with at least one of the alleles of the 
second allelic pattern. An example of linkage disequilibrium is that which occurs between the 
alleles at the IL-1RN (+2018) and IL-1RN (VNTR) polymorphic sites. The two alleles at IL- 
1RN (+2018) are 100% in linkage disequilibrium with the two most frequent alleles of IL- 
1RN (VNTR), which are allele 1 and allele 2. 
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The term "marker" refers to a sequence in the genome that is known to vary 
among individuals. For example, the IL-1RN gene has a marker that consists of a variable 
number of tandem repeats (VNTR). 

A "mutated gene" or "mutation" or "functional mutation" refers to an allelic form 
5 of a gene, which is capable of altering the phenotype of a subject having the mutated gene 
relative to a subject which does not have the mutated gene. The altered phenotype caused by a 
mutation can be corrected or compensated for by certain agents. If a subject must be 
homozygous for this mutation to have an altered phenotype, the mutation is said to be recessive. 
If one copy of the mutated gene is sufficient to alter the phenotype of the subject, the mutation is 
1 o said to be dominant. If a subj ect has one copy of the mutated gene and has a phenotype that is 
intermediate between that of a homozygous and that of a heterozygous subject (for that gene), 
the mutation is said to be co-dominant. 

A "non-human animal" of the invention includes mammals such as rodents, non- 
human primates, sheep, dogs, cows, goats, etc. amphibians, such a s members of the Xenopus 
15 genus, and transgenic avians (e.g. chickens, birds, etc.). The term "chimeric animal" is used 
herein to refer to animals in which the recombinant gene is found, or in which the recombinant 
gene is expressed in some but not all cells of the animal. The term "tissue-specific chimeric 
animal" indicates that one of the recombinant IL-1 genes is present and/or expressed or disrupted 
in some tissues but not others. The term "non-human mammal" refers to any member of the 
20 class Mammalia, except for humans. 

As used herein, the term "nucleic acid" refers to polynucleotides or 
oligonucleotides such as deoxyribonucleic acid (DNA), and, where appropriate, ribonucleic 
acid (RNA). The term should also be understood to include, as equivalents, analogs of either 
RNA or DNA made from nucleotide analogs (e.g. peptide nucleic acids) and as applicable to 
25 the embodiment being described, single (sense or antisense) and double-stranded 
polynucleotides . 

The term "polymorphism" refers to the coexistence of more than one form of a 
gene or portion (e.g., allelic variant) thereof. A portion of a gene of which there are at least two 
different forms, i.e., two different nucleotide sequences, is referred to as a "polymorphic region 
30 of a gene" . A specific genetic sequence at a polymorphic region of a gene is an allele. A 

polymorphic region can be a single nucleotide, the identity of which differs in different alleles. 
A polymorphic region can also be several nucleotides long. 
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The term "propensity to disease," also "predisposition" or "susceptibility" to 
disease or any similar phrase, means that certain alleles are hereby discovered to be associated 
with or predictive of a subject's incidence of developing a particular disease (e.g. a vascular 
disease). The alleles are thus over-represented in frequency in individuals with disease as 
5 compared to healthy individuals. Thus, these alleles can be used to predict disease even in 
pre-symptomatic or pre-diseased individuals. 

"Small molecule" as used herein, is meant to refer to a composition, which has a 
molecular weight of less than about 5kD and most preferably less than about 4kD. Small 
molecules can be nucleic acids, peptides, peptidomimetics, carbohydrates, lipids or other organic 
10 or inorganic molecules. 

As used herein, the term "specifically hybridizes" or "specifically detects" refers 
to the ability of a nucleic acid molecule to hybridize to at least approximately 6 consecutive 
nucleotides of a sample nucleic acid. 

"Transcriptional regulatory sequence" is a generic term used throughout the 
15 specification to refer to DNA sequences, such as initiation signals, enhancers, and promoters, 
which induce or control transcription of protein coding sequences with which they are operably 
linked. 

As used herein, the term "transgene" means a nucleic acid sequence (encoding, 
e.g., one of the IL-1 polypeptides, or an antisense transcript thereto) which has been introduced 

20 into a cell. A transgene could be partly or entirely heterologous, i.e., foreign, to the transgenic 
animal or cell into which it is introduced, or, is homologous to an endogenous gene of the 
transgenic animal or cell into which it is introduced, but which is designed to be inserted, or is 
inserted, into the animal's genome in such a way as to alter the genome of the cell into which it is 
inserted (e.g., it is inserted at a location which differs from that of the natural gene or its insertion 

25 results in a knockout). A transgene can also be present in a cell in the form of an episome. A 
transgene can include one or more transcriptional regulatory sequences and any other nucleic 
acid, such as introns, that may be necessary for optimal expression of a selected nucleic acid. 

A "transgenic animal" refers to any animal, preferably a non-human mammal, 
bird or an amphibian, in which one or more of the cells of the animal contain heterologous 

30 nucleic acid introduced by way of human intervention, such as by transgenic techniques well 
known in the art. The nucleic acid is introduced into the cell, directly or indirectly by 
introduction into a precursor of the cell, by way of deliberate genetic manipulation; such as by 
microinjection or by infection with a recombinant virus. The term genetic manipulation does not 
include classical cross-breeding, or in vitro fertilization, but rather is directed to the introduction 
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of a recombinant DNA molecule. This molecule may be integrated within a chromosome, or it 
may be extrachromosomally replicating DNA. In the typical transgenic animals described 
herein, the transgene causes cells to express a recombinant form of one of an IL-1 polypeptide, 
e.g. either agonistic or antagonistic forms. However, transgenic animals in which the 

5 recombinant gene is silent are also contemplated, as for example, the FLP or CRE recombinase 
dependent constructs described below. Moreover, "transgenic animal" also includes those 
recombinant animals in which gene disruption of one or more genes is caused by human 
intervention, including both recombination and antisense techniques. The term is intended to 
include all progeny generations. Thus, the founder animal and all Fl, F2, F3, and so on, progeny 

10 thereof are included. 

The term "treating" as used herein is intended to encompass curing as well as 
ameliorating at least one symptom of a condition or disease. 

The term "vector" refers to a nucleic acid molecule, which is capable of 
transporting another nucleic acid to which it has been linked. One type of preferred vector is an 

15 episome, i.e., a nucleic acid capable of extra-chromosomal replication. Preferred vectors are 
those capable of autonomous replication and/or expression of nucleic acids to which they are 
linked. Vectors capable of directing the expression of genes to which they are operatively linked 
are referred to herein as "expression vectors". In general, expression vectors of utility in 
recombinant DNA techniques are often in the form of "plasmids" which refer generally to 

20 circular double stranded DNA loops which, in their vector form are not bound to the 

chromosome. In the present specification, "plasmid" and "vector" are used interchangeably as the 
plasmid is the most commonly used form of vector. However, the invention is intended to 
include such other forms of expression vectors which serve equivalent functions and which 
become known in the art subsequently hereto. 

25 The term "wild-type allele" refers to an allele of a gene which, when present in 

two copies in a subject results in a wild-type phenotype. There can be several different wild-type 
alleles of a specific gene, since certain nucleotide changes in a gene may not affect the phenotype 
of a subject having two copies of the gene with the nucleotide changes. 

30 4.2 Predictive Medicine 

4.2. 1 . IL-1 Inflammatory Haplotypes and Their Association with Certain 
Diseases or Conditions. 

The present invention is based at least in part, on the identification of certain 
35 inflammatory haplotype patterns and the association (to a statistically significant extent) of these 
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pattems with the development of certain diseases or conditions. Therefore, detection of the 
alleles comprising a haplotype, alone or in conjunction with another means in a subject can 
indicate that the subject has or is predisposed to the development of a particular disease or 
condition. However, because these alleles are in linkage disequilibrium with other alleles, the 
5 detection of such other linked alleles can also indicate that the subj ect has or is predisposed to 
the development of a particular disease or condition. For example, the 441 12332 haplotype 
comprises the following genotype: 

10 allele 4 of the 222/223 marker of IL-1 A 

allele 4 of the gz5/gz6 marker of IL-1 A 

allele 1 of the -889 marker of IL-1 A 

allele 1 of the +3954 marker of IL-1B 

allele 2 of the -5 1 1 marker of IL-1B 

15 allele 3 of the gaat.p33330 marker 

allele 3 of the Y31 marker 

allele 2 of +2018 of IL-1RN 

all ele 1 of +4845 of IL-1 A 

allele 2 of the VNTR marker of IL-1RN 

20 

Three other polymorphisms in an IL-1RN alternative exon (Exon lie, which 
produces an intracellular form of the gene product) are also in linkage disequilibrium with allele 
2 of IL-1RN (VNTR) (Clay et al., (1996) Hum Genet 97:723-26). These include: IL-1RN exon 
lie (1812) (GenBank:X77090 at 1812); the IL-1RN exon lie (1868) polymorphism 

25 (GenBank:X77090 at 1868); and the IL-1RN exon lie (1887) polymorphism (GenBank:X77090 
at 1 887). Furthermore yet another polymorphism in the promoter for the alternatively spliced 
intracellular form of the gene, the Pic (1731) polymorphism (GenBank:X77090 at 1731), is also 
in linkage disequilibrium with allele 2 of the IL-1RN (VNTR) polymorphic locus. For each of 
these polymorphic loci, the allele 2 sequence variant has been determined to be in linkage 

30 disequilibrium with allele 2 of the IL-1RN (VNTR) locus (Clay et al., (1996) Hum Genet 
97:723-26). 
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The 33221461 haplotype comprises the following genotype: 



allele 3 of the 222/223 marker of IL-1 A 
allele 3 of the gz5/gz6 marker of IL-1 A 
allele 2 of the -889 marker of IL-1A 
allele 2 of the +3954 marker of IL-1B 
allele 1 of the -511 marker of IL-1B 
allele 4 of the gaat.p33330 marker 
allele 6 of the Y3 1 marker 

allele 1 of +2018 of IL-1RN 
allele 2 of +4845 of IL-1 A 

allele 1 of the VNTR marker of IL-1RN 



15 Individuals with the 441 12332 haplotype are typically overproduces of both IL- 

la and IL-ip proteins, upon stimulation. In contrast, individuals with the 33221461 haplotype 
are typically underproduces of IL-1 ra. Each haplotype results in a net proinflammatory 
response. Each allele within a haplotype may have an effect, as well as a composite genotype 
effect. In addition, particular diseases may be associated with both haplotype patterns. 

20 

The following Table 1 sets forth a number of genotype markers and various 
diseases and conditions to which these markers have been found to be associated to a statistically 
significant extent. 

TABLE 1 

25 Association Of IL-1 Haplotype Gene Markers With Certain Diseases 



GENOTYPE 


IL-1A 


IL-1 A 


IL-1B 


IL-1B 


IL-1RN 




(-889) 


(+4845) 


(-511) 


(+3954) 


(+2018) 














DISEASE 












Periodontal Disease 


(*2) 


*2 




*2 




Coronary Artery Disease 






*2 




*2 


Atherosclerosis 












Osteoporosis 










*2 


Insulin dependent diabetes 








*2 




Diabetic retinopathy 










*1 


Endstage renal diseases 










(+) 


Diabetic nephropathy 










*2 


Hepatic fibrosis (Japanese 
alcoholics) 










(+) 


Alopecia areata 










*2 


Graves' disease 










- * 2 


Graves' ophthalmopathy 










(") 


Extrathyroid disease 










(+) 



5 



10 
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OFNOTYPF 

VJ JU/1 N KJ 1 I ST Hf 


TI -1 A 


IT -1 A 


IL-1B 


TT -IB 


IL-1RN 




(-889) 


(+4845) 


(-511) 


(+3954) 


(+2018) 


Systemic Lupus 
Erythematosus 










*2 


Lichen Sclerosis 










*2 


Arthritis 










(+> 


Juvenile chronic arthritis 


*2 










Rheumatoid arthritis 










(+) 


Insulin dependent diabetes 








*2 


*2 VNTR 


Ulcerative colitis 










*2 


Asthma 






*2 


*2 




Multiple sclerosis 








(*2) 


*2VNTR 


Menopause, early onset 










*2 



In addition to the allelic patterns described above, as described herein, one of skill 
in the art can readily identify other alleles (including polymorphisms and mutations) that are in 

15 linkage disequilibrium with an allele associated with a disease or disorder. For example, a 

nucleic acid sample from a first group of subjects without a particular disorder can be collected, 
as well as DNA from a second group of subjects with the disorder. The nucleic acid sample can 
then be compared to identify those alleles that are over-represented in the second group as 
compared with the first group, wherein such alleles are presumably associated with a disorder, 

20 which is caused or contributed to by inappropriate interleukin 1 regulation. Alternatively, alleles 
that are in linkage disequilibrium with an allele that is associated with the disorder can be 
identified, for example, by genotyping a large population and performing statistical analysis to 
determine which alleles appear more commonly together than expected. Preferably the group is 
chosen to be comprised of genetically related individuals. Genetically related individuals 

25 include individuals from the same race, the same ethnic group, or even the same family. As the 
degree of genetic relatedness between a control group and a test group increases, so does the 
predictive value of polymorphic alleles which are ever more distantly linked to a disease-causing 
allele. This is because less evolutionary time has passed to allow polymorphisms which are 
linked along a chromosome in a founder population to redistribute through genetic cross-over 

30 events. Thus race-specific, ethnic-specific, and even family-specific diagnostic genotyping 
assays can be developed to allow for the detection of disease alleles which arose at ever more 
recent times in human evolution, e.g., after divergence of the major human races, after the 
separation of human populations into distinct ethnic groups, and even within the recent history of 
a particular family line. 

35 Linkage disequilibrium between two polymorphic markers or between one 

polymorphic marker and a disease-causing mutation is a meta-stable state. Absent selective 
pressure or the sporadic linked reoccurrence of the underlying mutational events,ihe 
polymorphisms will eventually become disassociated by chromosomal recombination events and 
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will thereby reach linkage equilibrium through the course of human evolution. Thus, the 
likelihood of finding a polymorphic allele in linkage disequilibrium with a disease or condition 
may increase with changes in at least two factors: decreasing physical distance between the 
polymorphic marker and the disease-causing mutation, and decreasing number of meiotic 
5 generations available for the dissociation of the linked pair. Consideration of the latter factor 
suggests that, the more closely related two individuals are, the more likely they will share a 
common parental chromosome or chromosomal region containing the linked polymorphisms and 
the less likely that this linked pair will have become unlinked through meiotic cross-over events 
occurring each generation. As a result, the more closely related two individuals are, the more 

10 likely it is that widely spaced polymorphisms may be co-inherited. Thus, for individuals related 
by common race, ethnicity or family, the reliability of ever more distantly spaced polymorphic 
loci can be relied upon as an indicator of inheritance of a linked disease-causing mutation. 

Appropriate probes may be designed to hybridize to a specific gene of the IL-1 
locus, such as IL-1 A, IL-1B or IL-1RN or a related gene. These genomic DNA sequences are 

15 shown in Figures 3, 4 and 5, respectively, and further correspond to SEQ ID Nos. 1, 2 and 3, 
respectively. Alternatively, these probes may incorporate other regions of the relevant genomic 
locus, including intergenic sequences. Indeed the IL-1 region of human chromosome 2 spans 
some 400,000 base pairs and, assuming an average of one single nucleotide polymorphism every 
1,000 base pairs, includes some 400 SNPs loci alone. Yet other polymorphisms available for use 

20 with the immediate invention are obtainable from various public sources. For example, the 
human genome database collects intragenic SNPs, is searchable by sequence and currently 
contains approximately 2,700 entries (http://hgbase.interactiva.de). Also available is a human 
polymorphism database maintained by the Massachusetts Institute of Technology (MIT SNP 
database (http://www.genome.wi.mit.edu/SNP/human/index.html)). From such sources SNPs as 

25 well as other human polymorphisms may be found. 

For example, examination of the IL-1 region of the human genome in any one of 
these databases reveals that the IL-1 locus genes are flanked by a centromere proximal 
polymorphic marker designated microsatellite marker AFM220ze3 at 127.4 cM (centiMorgans) 
(see GenBank Acc. No. Z17008) and a distal polymorphic marker designated microsatellite 

30 anchor marker AFM087xal at 127.9 cM (see GenBank Acc. No. Z16545). These human 

polymorphic loci are both CA dinucleotide repeat microsatellite polymorphisms, and, as such, 
show a high degree of heterozygosity in human populations. For example, one allele of 
AFM220ze3 generates a 21 1 bp PCR amplification product with a 5' primer of the sequence 
TGTACCTAAGCCCACCCTTTAGAGC (SEQ ID No. 4) and a 3' primer of the sequence 
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TGGCCTCCAGAAACCTCCAA (SEQ ID No. 5). Furthermore, one allele of AFM087xal 
generates a 177 bp PCR amplification product with a 5 ? primer of the sequence 
GCTGATATTCTGGTGGGAAA (SEQ ID No. 6) and a 3' primer of the sequence 
GGCAAGAGCAAAACTCTGTC (SEQ ID No. 7). Equivalent primers corresponding to unique 
5 sequences occurring 5' and 3 ! to these human chromosome 2 CA dinucleotide repeat 

polymorphisms will be apparent to one of skill in the art. Reasonable equivalent primers include 
those which hybridize within about 1 kb of the designated primer, and which further are 
anywhere from about 17 bp to about 27 bp in length. A general guideline for designing primers 
for amplification of unique human chromosomal genomic sequences is that they possess a 

10 melting temperature of at least about 50°C, wherein an approximate melting temperature can be 
estimated using the formula T melt = [2 x (# of A or T) + 4 x (# of G or C)]. 

A number of other human polymorphic loci occur between these two CA 
dinucleotide repeat polymorphisms and provide additional targets for determination of a 
prognostic allele in a family or other group of genetically related individuals. For example, the 

15 National Center for Biotechnology Information web site (www.ncbi.nlm.nih.gov/genemap/) lists 
a number of polymorphism markers in the region of the IL-1 locus and provides guidance in 
designing appropriate primers for amplification and analysis of these markers. 

Accordingly, the nucleotide segments of the invention may be used for their 
ability to selectively form duplex molecules with complementary stretches of human 

20 chromosome 2 q 12-13 or cDNAs from that region or to provide primers for amplification of 
DNA or cDNA from this region. The design of appropriate probes for this purpose requires 
consideration of a number of factors. For example, fragments having a length of between 10, 15, 
or 18 nucleotides to about 20, or to about 30 nucleotides, will find particular utility. Longer 
sequences, e.g., 40, 50, 80, 90, 100, even up to full length, are even more preferred for certain 

25 embodiments. Lengths of oligonucleotides of at least about 18 to 20 nucleotides are well 

accepted by those of skill in the art as sufficient to allow sufficiently specific hybridization so as 
to be useful as a molecular probe. Furthermore, depending on the application envisioned, one 
will desire to employ varying conditions of hybridization to achieve varying degrees of 
selectivity of probe towards target sequence. For applications requiring high selectivity, one will 

30 typically desire to employ relatively stringent conditions to form the hybrids. For example, 
relatively low salt and/or high temperature conditions, such as provided by 0.02 M-0.15M NaCl 
at temperatures of about 50° C to about 70° C. Such selective conditions may tolerate little, if 
any, mismatch between the probe and the template or target strand. 
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Other alleles or other indicia of a disorder can be detected or monitored in a 
subject in conjunction with detection of the alleles described above, for example, identifying 
vessel wall thickness (e.g. as measured by ultrasound), or whether the subject smokes, drinks is 
overweight, is under stress or exercises. 

5 

4.2.2 Detection of Alleles 

Many methods are available for detecting specific alleles at human polymorphic 
loci. The preferred method for detecting a specific polymorphic allele will depend, in part, upon 
the molecular nature of the polymorphism. For example, the various allelic forms of the 

10 polymorphic locus may differ by a single base-pair of the DNA. Such single nucleotide 

polymorphisms (or SNPs) are major contributors to genetic variation, comprising some 80% of 
all known polymorphisms, and their density in the human genome is estimated to be on average 
1 per 1,000 base pairs. SNPs are most frequently biallelic- occurring in only two different forms 
(although up to four different forms of an SNP, corresponding to the four different nucleotide 

15 bases occurring in DNA, are theoretically possible). Nevertheless, SNPs are mutationally more 
stable than other polymorphisms, making them suitable for association studies in which linkage 
disequilibrium between markers and an unknown variant is used to map disease-causing 
mutations. In addition, because SNPs typically have only two alleles, they can be genotyped by 
a simple plus/minus assay rather than a length measurement, making them more amenable to 

20 automation. 

A variety of methods are available for detecting the presence of a particular single 
nucleotide polymorphic allele in an individual. Advancements in this field have provided 
accurate, easy, and inexpensive large-scale SNP genotyping. Most recently, for example, several 
new techniques have been described including dynamic allele-specific hybridization (DASH), 

25 microplate array diagonal gel electrophoresis (MADGE), pyrosequencing, oligonucleotide- 
specific ligation, the TaqMan system as well as various DNA "chip" technologies such as the 
Affymetrix SNP chips. These methods require amplification of the target genetic region, 
typically by PCR. Still other newly developed methods, based on the generation of small signal 
molecules by invasive cleavage followed by mass spectrometry or immobilized padlock probes 

30 and rolling-circle amplification, might eventually eliminate the need for PCR. Several of the 

methods known in the art for detecting specific single nucleotide polymorphisms are summarized 
below. The method of the present invention is understood to include all available methods. 

Several methods have been developed to facilitate analysis of single nucleotide 
polymorphisms. In one embodiment, the single base polymorphism can be detected by using a 
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specialized exonuclease-resistant nucleotide, as disclosed, e.g., in Mimdy, C. R. (U.S. Pat, 
No.4,656,127). According to the method, a primer complementary to the allelic sequence 
immediately 3' to the polymorphic site is permitted to hybridize to a target molecule obtained 
from a particular animal or human. If the polymorphic site on the target molecule contains a 
5 nucleotide that is complementary to the particular exonuclease-resistant nucleotide derivative 
present, then that derivative will be incorporated onto the end of the hybridized primer. Such 
incorporation renders the primer resistant to exonuclease, and thereby permits its detection. 
Since the identity of the exonuclease-resistant derivative of the sample is known, a finding that 
the primer has become resistant to exonucleases reveals that the nucleotide present in the 

10 polymorphic site of the target molecule was complementary to that of the nucleotide derivative 
used in the reaction. This method has the advantage that it does not require the determination of 
large amounts of extraneous sequence data. 

In another embodiment of the invention, a solution-based method is used for 
determining the identity of the nucleotide of a polymorphic site. Cohen, D. et al. (French Patent 

15 2,650,840; PCT Appln. No. WO91/02087). As in the Mundy method of U.S. Pat. No. 

4,656,127, a primer is employed that is complementary to allelic sequences immediately 3' to a 
polymorphic site. The method determines the identity of the nucleotide of that site using labeled 
dideoxynucleotide derivatives, which, if complementary to the nucleotide of the polymorphic 
site will become incorporated onto the terminus of the primer. 

20 An alternative method, known as Genetic Bit Analysis or GBA ™ is described by 

Goelet, P. et al. (PCT Appln. No. 92/15712). The method of Goelet, P. et aL uses mixtures of 
labeled terminators and a primer that is complementary to the sequence 3' to a polymorphic site. 
The labeled terminator that is incorporated is thus determined by, and complementary to, the 
nucleotide present in the polymorphic site of the target molecule being evaluated. In contrast to 

25 the method of Cohen et al. (French Patent 2,650,840; PCT Appln. No. WO91/02087) the method 
of Goelet, P. et al. is preferably a heterogeneous phase assay, in which the primer or the target 
molecule is immobilized to a solid phase. 

Recently, several primer-guided nucleotide incorporation procedures for assaying 
polymorphic sites in DNA have been described (Komher, J. S. et al., Nucl. Acids. Res. 

30 17:7779-7784 (1989); Sokolov, B. P., NucL Acids Res. 18:3671 (1990); Syvanen, A. -C, et al., 
Genomics 8:684-692 (1990); Kuppuswamy, M. N. et al., Proc. Natl. Acad. Sci. (U.S.A.) 
88:1143-1147 (1991); Prezant, T. R. et al., Hum. Mutat. 1:159-164 (1992); Ugozzoli, L. et al., 
GATA 9:107-1 12 (1992); Nyren, P. et aL, Anal. Biochem. 208:171-175 (1993)). These methods 
differ from GBA ™ in that they all rely on the incorporation of labeled deoxynucleotides to 



10 



15 



20 



25 



30 



-25- 



discriminate between bases at a polymorphic site. In such a format, since the signal is 
proportional to the number of deoxynucleotides incorporated, polymorphisms that occur in runs 
of the same nucleotide can result in signals tha t are proportional to the length of the run 



(Syvanen, A. -C, et ah, Amer. J. Hum. Genet. 



52:46-59(1993)). 



For mutations that produce premature termination of protein translation, the 
protein truncation test (PTT) offers an efficient diagnostic approach (Roest, et. aL, (1993) Hum. 
MoL Genet 2:1719-21; van der Luijt, et. al., (1994) Genomics 20:1-4). For PTT, RNA is 
initially isolated from available tissue and reverse-transcribed, and the segment of interest is 
amplified by PCR. The products of reverse transcription PCR are then used as a template for 
nested PCR amplification with a primer that contains an RNA polymerase promoter and a 
sequence for initiating eukaryotic translation. After amplification of the region of interest, the 
unique motifs incorporated into the primer pemit sequential in vitro transcription and translation 
of the PCR products. Upon sodium dodecyl sulfate-polyacrylamide gel electrophoresis of 
translation products, the appearance of truncated polypeptides signals the presence of a mutation 
that causes premature termination of translation. In a variation of this technique, DNA (as 
opposed to RNA) is used as a PCR template when the target region of interest is derived from a 



single exon. 



Any cell type or tissue may be 



utilized to obtain nucleic acid samples for use in 
the diagnostics described herein. In a preferred embodiment, the DNA sample is obtained from a 
bodily fluid, e.g, blood, obtained by known techniques (e.g. venipuncture) or saliva. 
Alternatively, nucleic acid tests can be performed on dry samples (e.g. hair or skin). When using 
RNA or protein, the cells or tissues that may be utilized must express an IL-1 gene. 

Diagnostic procedures may also be performed in situ directly upon tissue sections 
(fixed and/or frozen) of patient tissue obtained from biopsies or resections, such that no nucleic 
acid purification is necessary. Nucleic acid reagents may be used as probes and/or primers for 
such in situ procedures (see, for example, Nuovo, GJ., 1992, PCR in situ hybridization: 
protocols and applications, Raven Press, NY). 

In addition to methods which focus primarily on the detection of one nucleic acid 
sequence, profiles may also be assessed in such detection schemes. Fingerprint profiles may be 
generated, for example, by utilizing a differential display procedure, Northern analysis and/or 



RT-PCR. 



A preferred detection method is allele specific hybridization using probes 



overlapping a region of at least one allele o 



an IL-1 proinflammatory haplotype and having 



about 5, 10, 20, 25, or 30 nucleotides around the mutation or polymorphic region. In a preferred 
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embodiment of the invention, several probes capable of hybridizing specifically to other allelic 



variants involved in a restenosis are attached to 
hold up to about 250,000 oligonucleotides). O 



(1996) Human Mutation 7:244. In one embod: 



a solid phase support, e.g., a "chip" (which can 
igonucleotides can be bound to a solid support by 



a variety of processes, including lithography. Mutation detection analysis using these chips 
comprising oligonucleotides, also termed "DNA probe arrays" is described e.g., in Cronin et ah 



ment, a chip comprises all the allelic variants of at 



least one polymorphic region of a gene. The s olid phase support is then contacted with a test 
nucleic acid and hybridization to the specific probes is detected. Accordingly, the identity of 
numerous allelic variants of one or more gene^ can be identified in a simple hybridization 
experiment. 

These techniques may also comprise the step of amplifying the nucleic acid 



before analysis. Amplification techniques are 



known to those of skill in the art and include, but 



are not limited to cloning, polymerase chain reaction (PCR), polymerase chain reaction of 
specific alleles (ASA), ligase chain reaction (LCR), nested polymerase chain reaction, self 

15 sustained sequence replication (Guatelli, J.C. st al., 1990, Proc. Natl. Acad. Sci. USA 87:1874- 
1878), transcriptional amplification system (Kwoh, D.Y. et al., 1989, Proc. Natl. Acad. Sci. USA 
86:1 173-1 177), and Q- Beta Replicase (Lizardi, P.M. et al., 1988, Bio/Technology 6:1 197). 

Amplification products may be assayed in a variety of ways, including size 
analysis, restriction digestion followed by sizs analysis, detecting specific tagged oligonucleotide 

20 primers in the reaction products, allele-specif :c oligonucleotide (ASO) hybridization, allele 



specific 5' exonuclease detection, sequencing 



hybridization, and the like. 



PCR based detection means can include multiplex amplification of a plurality of 



markers simultaneously. For example, it is w< 
generate PCR products that do not overlap in 



means allow the differential detection ofmu' 
are known in the art to allow multiplex anal) 



ell known in the art to select PCR primers to 
size and can be analyzed simultaneously. 
Alternatively, it is possible to amplify different markers with primers that are differentially 
labeled and thus can each be differentially detected. Of course, hybridization based detection 

tiple PCR products in a sample. Other techniques 
ses of a plurality of markers. 
In a merely illustrative embodiment, the method includes the steps of (i) 
collecting a sample of cells from a patient, (ii) isolating nucleic acid (e.g., genomic, mRNA or 
both) from the cells of the sample, (iii) contacting the nucleic acid sample with one or more 
primers which specifically hybridize 5' and 3' to at least one allele of an IL-1 proinflammatory 
haplotype under conditions such that hybridization and amplification of the allele occurs, and 



(iv) detecting the amplification product. These detection schemes are especially useful for the 
detection of nucleic acid molecules if such molecules are present in very low numbers. 

In a preferred embodiment of the subject assay, the allele of an IL-1 
proinflammatory haplotype is identified by alterations in restriction enzyme cleavage patterns. 
5 For example, sample and control DNA is isolated, amplified (optionally), digested with one or 
more restriction endonucleases, and fragment length sizes are determined by gel electrophoresis. 

In yet another embodiment, any cf a variety of sequencing reactions known in the 
art can be used to directly sequence the allele. Exemplary sequencing reactions include those 
based on techniques developed by Maxim and G ilbert ((1977) Proc. Natl Acad Sci USA 74:560) 
10 or Sanger (Sanger et al (1977) Proc. Nat. Acad. Sci USA 74:5463). It is also contemplated that 
any of a variety of automated sequencing procedures may be utilized when performing the 
subject assays (see, for example Biotechniques I 
spectrometry (see, for example PCT publication * 



1995) 19:448), including sequencing by mass 
WO 94/16101; Cohen et al. (1996) Adv 



Chromatogr 36:127-162; and Griffin et al. (199B) Appl Biochem Biotechnol 38:147-159). It will 

15 be evident to one of skill in the art that, for certain embodiments, the occurrence of only one, two 
or three of the nucleic acid bases need be determined in the sequencing reaction. For instance, 
A-track or the like, e.g., where only one nucleic; acid is detected, can be carried out. 

In a further embodiment, protection from cleavage agents (such as a nuclease, 
hydroxylamine or osmium tetroxide and with piperidine) can be used to detect mismatched bases 

20 in RNA/RNA or RNA/DNA or DNA/DNA heteroduplexes (Myers, et al. (1985) Science 
230:1242). In general, the art technique of "mismatch cleavage" starts by providing 
heteroduplexes formed by hybridizing (labeled) RNA or DNA containing the wild-type allele 
with the sample. The double-stranded duplexe s are treated with an agent which cleaves single- 
stranded regions of the duplex such as which will exist due to base pair mismatches between the 

25 control and sample strands. For instance, RNvWDNA duplexes can be treated with RNase and 
DNA/DNA hybrids treated with SI nuclease to enzymatically digest the mismatched regions. In 
other embodiments, either DNA/DNA or RN^t/DNA duplexes can be treated with 
hydroxylamine or osmium tetroxide and with piperidine in order to digest mismatched regions. 
After digestion of the mismatched regions, the resulting material is then separated by size on 

30 denaturing polyacrylamide gels to determine t he site of mutation. See, for example, Cotton et al 
(1988) Proc. Natl Acad Sci USA 85:4397; and Saleeba et al (1992) Methods EnzymoL 217:286- 
295. In a preferred embodiment, the control DNA or RNA can be labeled for detection. 

In still another embodiment, the mismatch cleavage reaction employs one or more 
proteins that recognize mismatched base pains in double-stranded DNA (so called "DNA 
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mismatch repair" enzymes). For example, the mutY enzyme of E. coli cleaves A at G/A 
mismatches and the thymidine DNA glycosylase from HeLa cells cleaves T at G/T mismatches 
(Hsu et al (1994) Carcinogenesis 15:1657-1662). According to an exemplary embodiment, a 
probe based on an allele of an IL-1 locus haplot}pe is hybridized to a cDNA or other DNA 
product from a test cell(s). The duplex is treated with a DNA mismatch repair enzyme, and the 
cleavage products, if any, can be detected from electrophoresis protocols or the like. See, for 
example, U.S. Patent No. 5,459,039. 

In other embodiments, alteration > in electrophoretic mobility will be used to 
identify an IL-1 locus allele. For example, single strand conformation polymorphism (SSCP) 
may be used to detect differences in electrophoretic mobility between mutant and wild type 
nucleic acids (Orita et al. (1989) Proc Natl. Acad. Sci USA 86:2766, see also Cotton (1993) 
Mutat Res 285:125-144; and Hayashi (1992) Gbnet Anal Tech Appl 9:73-79). Single-stranded 
DNA fragments of sample and control IL-1 loc as alleles are denatured and allowed to renature. 
The secondary structure of single-stranded mid eic acids varies according to sequence, the 
resulting alteration in electrophoretic mobility enables the detection of even a single base change. 
The DNA fragments may be labeled or detected with labeled probes. The sensitivity of the assay 
may be enhanced by using RNA (rather than DNA), in which the secondary structure is more 
sensitive to a change in sequence. In a preferred embodiment, the subject method utilizes 
heteroduplex analysis to separate double stranded heteroduplex molecules on the basis of 

. (1991) Trends Genet 7:5). 
movement of alleles in polyacrylamide gels 
using denaturing gradient gel electrophoresis 
. When DGGE is used as the method of analysis, 



changes in electrophoretic mobility (Keen et a 
In yet another embodiment, the 
containing a gradient of denaturant is assayed 
(DGGE) (Myers et al. (1985) Nature 313:495) 
DNA will be modified to insure that it does not completely denature, for example by adding a 



25 GC clamp of approximately 40 bp of high-me 



tingGC-richDNAbyPCR. In a further 



embodiment, a temperature gradient is used in place of a denaturing agent gradient to identify 
differences in the mobility of control and sample DNA (Rosenbaum and Reissner (1987) 
Biophys Chem 265:12753). 

Examples of other techniques for detecting alleles include, but are not limited to, 
30 selective oligonucleotide hybridization, selec ;ive amplification, or selective primer extension. 
For example, oligonucleotide primers may be prepared in which the known mutation or 
nucleotide difference (e.g., in allelic variants} is placed centrally and then hybridized to target 
DNA under conditions which permit hybridization only if a perfect match is found (Saiki et al. 
(1986) Nature 324:163); Saiki et al (1989) pJoc. Natl Acad. Sci USA 86:6230). Such allele 
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specific oligonucleotide hybridization techniques may be used to test one mutation or 
polymorphic region per reaction when oligonucleotides are hybridized to PCR amplified target 
DNA or a number of different mutations or polymorphic regions when the oligonucleotides are 
attached to the hybridizing membrane and hybridized with labelled target DNA. 



Alternatively, allele specific amp 



the center of the molecule (so that amplification 



ification technology which depends on selective 



PCR amplification may be used in conjunction with the instant invention. Oligonucleotides used 
as primers for specific amplification may carry tie mutation or polymorphic region of interest in 



al (1989) Nucleic Acids Res. 17:2437-2448) or at the extreme 3' end of one primer where, under 
appropriate conditions, mismatch can prevent, o r reduce polymerase extension (Prossner (1993) 



Tibtech 1 1 :238. In addition it may be desirable 



depends on differential hybridization) (Gibbs et 



to introduce a novel restriction site in the region 



of the mutation to create cleavage-based detection (Gasparini et al (1992) Mol. Cell Probes 6:1). 
It is anticipated that in certain embodiments amplification may also be performed using Taq 
ligase for amplification (Barany (1991) Proc. Natl. Acad. Sci USA 88:189). In such cases, 
ligation will occur only if there is a perfect match at the 3' end of the 5* sequence making it 
possible to detect the presence of a known mutation at a specific site by looking for the presence 
or absence of amplification. 

In another embodiment, identific ation of the allelic variant is carried out using an 
oligonucleotide ligation assay (OLA), as described, e.g., in U.S. Pat. No. 4,998,617 and in 
Landegren, U. et al ((1988) Science 241:1077-1080). The OLA protocol uses two 
oligonucleotides which are designed to be capa ble of hybridizing to abutting sequences of a 
single strand of a target. One of the oligonucleqtides is linked to a separation marker, e.g,. 
biotinylated, and the other is detectably labeled. If the precise complementary sequence is found 



in a target molecule, the oligonucleotides will 



Lybridize such that their termini abut, and create a 



25 ligation substrate. Ligation then permits the labeled oligonucleotide to be recovered using avidin, 
or another biotin ligand. Nickerson, D. A. et al. have described a nucleic acid detection assay that 
combines attributes of PCR and OLA (Nickers on, D. A. et al. (1990) Proc. Natl. Acad. Sci. USA 
87:8923-27). In this method, PCR is used to achieve the exponential amplification of target 



DNA, which is then detected using OLA. 

Several techniques based on thib OLA method have been developed and can be 
used to detect alleles of an IL-1 locus haplotype. For example, U.S. Patent No. 5,593,826 
discloses an OLA using an oligonucleotide having 3'-amino group and a 5'-phosphorylated 
oligonucleotide to form a conjugate having a phosphoramidate linkage. In another variation of 
OLA described in Tobe et al. ((1996) Nucleic Acids Res 24: 3728), OLA combined with PCR 



10 



permits typing of two alleles in a single 
primers with a unique hapten, i.e. digoxigenin 
detected by using hapten specific antibodies 
alkaline phosphatase or horseradish peroxidas 
alleles using a high throughput format that 

Another embodiment of the i 
predisposition for developing a restenosis 
including 5 ? and 3' oligonucleotides that 
haplotype. PCR amplification oligonucleotides 
apart, preferably between about 100 and 
product of convenient size for subsequent 

Particularly preferred primers 
include SEQ ID Nos?§^ 



microliter well. By marking each of the allele-specific 
and fluorescein, each OLA reaction can be 
tjtiat are labeled with different enzyme reporters, 
e. This system permits the detection of the two 
to the production of two different colors, 
indention is directed to kits for detecting a 

kit may contain one or more oligonucleotides, 
hybridize 5' and V to at least one allele of an IL-1 locus 
should hybridize between 25 and 2500 base pairs 
aboxit 500 bases apart, in order to produce a PCR 
lysis. 

for use in the diagnostic method of the invention 



leads 



t ana 



the; 



The design of additional olig< 
15 detection of IL-1 polymorphic alleles by the 

availability of both updated sequence information 
contains the human IL-1 locus, and updated 
locus. For example, the DNA sequence for 
(GenBank Accession No. X03833), 2 (GenBjmk 
20 Accession No. X64532) respectively. Suitable 
polymorphism in these genes can be readily 
standard techniques known in the art for the 
Optimal design of such primer sequences can 
commercially available primer selection pro 
25 (See also, Nicklin M.H J., Weith A. Duff G 
the Human Interleukin-1 a, interleukin-1 p, 
Genomics 19: 382 (1995); Nothwang H.G., 
Cluster: Construction of an Integrated 
Region of Chromosome 2ql3" Genomics 41 
30 14:7897-7914 [published erratum appears in 
Database (GDB) project at the URL http://wjvw. 

For use in a kit, oligonucleotides 
synthetic compositions such as synthetic 
synthetic peptide nucleic acids (PNAs), and 



and 



et ; 



. YAC/PAC 
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onucleotides for use in the amplification and 
ijnethod of the invention is facilitated by the 

from human chromosome 2ql3 - which 
l[uman polymorphism information available for this 
IL-1 A, IL-1B and IL-1RN is shown in Figures 1 

Accession No. X04500) and 3 (GenBank 
primers for the detection of a human 
Resigned using this sequence information and 
design and optimization of primers sequences, 
be achieved, for example, by the use of 
rams such as Primer 2.1, Primer 3 or GeneFisher 
, "A Physical Map of the Region Encompassing 
Interleukin-1 Receptor Antagonist Genes" 
al. "Molecular Cloning of the Interleukin-1 gene 

Contig and a partial transcriptional Map in the 
370 (1997); Clark, et al. (1986) Nucl. Acids. Res., 
Nucleic Acids Res., 15:868 (1987) and the Genome 
.gdb.org). 

may be any of a variety of natural and/or 
oligonucleotides, restriction fragments, cDNAs, 

the like. The assay kit and method may also employ 
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Iabeled oligonucleotides to allow ease of identif ication in the assays. Examples of labels which 
may be employed include radio-labels, enzymes, fluorescent compounds, streptavidin, avidin, 
biotin, magnetic moieties, metal binding moieties, antigen or antibody moieties, and the like. 



The kit may, optionally, also inc: 
means are well known to one of skill in the art 



ude DNA sampling means. DNA sampling 
nd can include, but not be limited to substrates, 



such as filter papers, the AmpliCard™ (University of Sheffield, Sheffield, England S10 2JF; 
Tarlow, JW, et al. 9 J. of Invest Dermatol 103:2 87-389 (1994)) and the like; DNA purification 
reagents such as Nucleon™ kits, lysis buffers, proteinase solutions and the like; PCR reagents, 
such as 10X reaction buffers, thermostable polymerase, dNTPs, and the like; and allele detection 
10 means such as the Hinfi restriction enzyme, allele specific oligonucleotides, degenerate 
oligonucleotide primers for nested PCR from dried blood. 

4.2.3. Pharmacogenomics 

15 Knowledge of the particular alleles associated with a susceptibility to developing 

a particular disease or condition, alone or in conjunction with information on other genetic 
defects contributing to the particular disease oi condition allows a customization of the 
prevention or treatment in accordance with the individual's genetic profile, the goal of 
"pharmacogenomics 1 '. Thus, comparison of an individual's IL-1 profile to the population profile 

20 for a vascular disorder, permits the selection o:r design of drugs or other therapeutic regimens that 
are expected to be safe and efficacious for a particular patient or patient population (i.e., a group 
of patients having the same genetic alteration) . 



In addition, the ability to target 
benefit, based on genetic profile can enable: 1 



25 the rescue of drug candidates whose clinical development has been discontinued as a result of 
safety or efficacy limitations, which are patier.t subgroup-specific; and 3) an accelerated and less 



populations expected to show the highest clinical 
) the repositioning of already marketed drugs; 2) 



and more optimal drug labeling (e.g. since 
ent on the causative mutation is useful for 

with a particular therapeutic can be monitored by 
IRa), mRNA and/or transcriptional level. 



costly development for candidate therapeutics 
measuring the effect of various doses of an agi 
optimizing effective dose). 

The treatment of an individual 
determining protein (e.g. IL-1 a, IL-ip, or IL 
Depending on the level detected, the theraper tic regimen can then be maintained or adjusted 
(increased or decreased in dose). In a preferred embodiment, the effectiveness of treating a 



35 



subject with an agent comprises the steps of: 
subject prior to administration of the agent; ( 



(i) obtaining a preadministration satftple from a 

i) detecting the level or amount of a protein, mRNA 
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or genomic DNA in the preadministration sample; (iii) obtaining one or more post- 
administration samples from the subject; (iv) detecting the level of expression or activity of the 
protein, mRNA or genomic DNA in the post-administration sample; (v) comparing the level of 
expression or activity of the protein, mRNA or genomic DNA in the preadministration sample 
with the corresponding protein, mRNA or geno nic DNA in the postadministration sample, 
respectively; and (vi) altering the administration of the agent to the subject accordingly. 

Cells of a subject may also be obtained before and after administration of a 
therapeutic to detect the level of expression of genes other than an IL-1 gene to verify that the 
therapeutic does not increase or decrease the expression of genes which could be deleterious. 
This can be done, e.g., by using the method of transcriptional profiling. Thus, mRNA from cells 
exposed in vivo to a therapeutic and mRNA fern the same type of cells that were not exposed to 
the therapeutic could be reverse transcribed and hybridized to a chip containing DNA from 
numerous genes, to thereby compare the expression of genes in cells treated and not treated with 
the therapeutic. 

4.3 Therapeutics For Diseases and Conditions Associated with IL-1 
Polymorph isms 

Therapeutic for diseases or conditions associated with an IL-1 polymorphism or 
haplotype refers to any agent or therapeutic regimen (including pharmaceuticals, nutraceuticals 



and surgical means) that prevents or postpones 



the development of or alleviates the symptoms of 



the particular disease or condition in the subject. The therapeutic can be a polypeptide, 
peptidomimetic, nucleic acid or other inorganic or organic molecule, preferably a "small 
molecule" including vitamins, minerals and otier nutrients. Preferably the therapeutic can 
modulate at least one activity of an IL-1 polypeptide, e.g., interaction with a receptor, by 
mimicking or potentiating (agonizing) or inhibiting (antagonizing) the effects of a naturally- 
occurring polypeptide. An agonist can be a wild-type protein or derivative thereof having at 
least one bioactivity of the wild-type, e.g., receptor binding activity. An agonist can also be a 
compound that upregulates expression of a ge ie or which increases at least one bioactivity of a 
protein. An agonist can also be a compound which increases the interaction of a polypeptide 
with another molecule, e.g., a receptor. An an tagonist can be a compound which inhibits or 
decreases the interaction between a protein and another molecule, e.g., a receptor or an agent that 



blocks signal transduction or post- translation 



processing (e.g., IL-1 converting enzyme (ICE) 



inhibitor). Accordingly, a preferred antagonist is a compound which inhibits or decreases 
binding to a receptor and thereby blocks subsequent activation of the receptor. An antagonist 
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can also be a compound that downregulates expression of a gene or which reduces the amount of 
a protein present. The antagonist can be a dominant negative form of a polypeptide, e.g., a form 
of a polypeptide which is capable of interacting with a target peptide, e.g., a receptor, but which 
does not promote the activation of the receptor, 'fhe antagonist can also be a nucleic acid 
encoding a dominant negative form of a polypeptide, an antisense nucleic acid, or a ribozyme 



capable of interacting specifically with an RNA. 
to a polypeptide and inhibit its action. Such mo 



Yet other antagonists are molecules which bind 
ecules include peptides, e.g., forms of target 



peptides which do not have biological activity, and which inhibit binding to receptors. Thus, 
such peptides will bind to the active site of a protein and prevent it from interacting with target 
10 peptides. Yet other antagonists include antibodies that specifically interact with an epitope of a 
molecule, such that binding interferes with the biological function of the polypeptide. In yet 



another preferred embodiment, the antagonist is 
inhibiting the interaction between a polypeptide 



a small molecule, such as a molecule capable of 
and a target receptor. Alternatively, the small 



molecule can function as an antagonist by interacting with sites other than the receptor binding 
site. 

Modulators of IL-1 (e.g. IL-la, IL-lp or IL-1 receptor antagonist) or a protein 
encoded by a gene that is in linkage disequilibrium with an IL-1 gene can comprise any type of 
compound, including a protein, peptide, peptidomimetic, small molecule, or nucleic acid. 
Preferred agonists include nucleic acids (e.g. encoding an IL-1 protein or a gene that is up- or 

20 down-regulated by an IL-1 protein), proteins (e.g. IL-1 proteins or a protein that is up- or down- 
regulated thereby) or a small molecule (e.g. that regulates expression or binding of an IL-1 
protein). Preferred antagonists, which can be identified, for example, using the assays described 
herein, include nucleic acids (e.g. single (antiseuse) or double stranded (triplex) DNA or PNA 
and ribozymes), protein (e.g. antibodies) and small molecules that act to suppress or inhibit IL-1 

25 transcription and/or protein activity. 

4.3.1 . Effective Dose 

Toxicity and therapeutic efficadr of such compounds can be determined by 
standard pharmaceutical procedures in cell cultures or experimental animals, e.g., for 
30 determining The LD50 (the dose lethal to 50% of the population) and the Edso (the dose 

therapeutically effective in 50% of the population). The dose ratio between toxic and therapeutic 
effects is the therapeutic index and it can be expressed as the ratio LD50/ED50. Compounds 
which exhibit large therapeutic indices are preferred. While compounds that exhibit toxic side 
effects may be used, care should be taken to design a delivery system that targets'such 
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compounds to the site of affected tissues in order! to minimize potential damage to uninfected 
cells and, thereby, reduce side effects. 

The data obtained from the cell ciilture assays and animal studies can be used in 



formulating a range of dosage for use in humans 



The dosage of such compounds lies preferably 



within a range of circulating concentrations that Include the ED50 with little or no toxicity. The 
dosage may vary within this range depending up an the dosage form employed and the route of 



administration utilized. For any compound used 



therapeutically effective dose can be estimated initially from cell culture assays. A dose may be 
formulated in animal models to achieve a circulating plasma concentration range that includes 
10 the IC50 (i.e., the concentration of the test compound which achieves a half-maximal inhibition 
of symptoms) as determined in cell culture. Such information can be used to more accurately 
determine useful doses in humans. Levels in plasma may be measured, for example, by high 
performance liquid chromatography. 



in the method of the invention, the 



4.3.2. Formulation and Use 

Compositions for use in accordance with the present invention may be formulated 
in a conventional manner using one or more physiologically acceptable carriers or excipients. 



Thus, the compounds and their physiologically 



20 or the nose) or oral, buccal, parenteral or rectal 



acceptable salts and solvates may be formulated 



for administration by, for example, injection, inhalation or insufflation (either through the mouth 



administration. 



For such therapy, the compound:; of the invention can be formulated for a variety 
of loads of administration, including systemic and topical or localized administration. 



Techniques and formulations generally may be 



found in Remmington's Pharmaceutical 



Sciences, Meade Publishing Co., Easton, PA. For systemic administration, injection is preferred, 
25 including intramuscular, intravenous, intraperitoneal, and subcutaneous. For injection, the 

compounds of the invention can be formulated in liquid solutions, preferably in physiologically 
compatible buffers such as Hank's solution or Ilinger's solution. In addition, the compounds 
may be formulated in solid form and redissolved or suspended immediately prior to use. 
Lyophilized forms are also included. 
30 For oral administration, the compositions may take the form of, for example, 

tablets or capsules prepared by conventional means with pharmaceutically acceptable excipients 
such as binding agents (e.g., pregelatinised maize starch, polyvinylpyrrolidone or hydroxypropyl 
methylcellulose); fillers (e.g., lactose, microcrystalline cellulose or calcium hydrogen 
phosphate); lubricants (e.g., magnesium stearate, talc or silica); disintegrants (e.g., potato starch 



20 



or sodium starch glycolate); or wetting agents (e.g., sodium lauryl sulfate). The tablets may be 
coated by methods well known in the art. Liquid preparations for oral administration may take 
the form of, for example, solutions, syrups or suspensions, or they may be presented as a dry 
product for constitution with water or other suitable vehicle before use. Such liquid preparations 



may be prepared by conventional means with p' 



larmaceutically acceptable additives such as 



suspending agents (e.g., sorbitol syrup, cellulose derivatives or hydrogenated edible fats); 
emulsifying agents (e.g., lecithin or acacia); nou-aqueous vehicles (e.g., ationd oil, oily esters, 
ethyl alcohol or fractionated vegetable oils); anil preservatives (e.g., methyl or propyl-p- 
hydroxybenzoates or sorbic acid). The preparations may also contain buffer salts, flavoring, 
10 coloring and sweetening agents as appropriate. 

Preparations for oral administration may be suitably formulated to give controlled 
release of the active compound. For buccal administration the compositions may take the form 
of tablets or lozenges formulated in conventional manner. For administration by inhalation, the 
compounds for use according to the present invention are conveniently delivered in the form of 
15 an aerosol spray presentation from pressurized packs or a nebuliser, with the use of a suitable 
propellant, e.g., dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, 
carbon dioxide or other suitable gas. In the case of a pressurized aerosol the dosage unit may be 
determined by providing a valve to deliver a metered amount. Capsules and cartridges of e.g., 

be formulated containing a powder mix of the 
lactose or starch. 

The compounds may be formullated for parenteral administration by injection, 
e.g., by bolus injection or continuous infusion. Formulations for injection may be presented in 
unit dosage form, e.g., in ampoules or in mul ;i-dose containers, with an added preservative. The 
compositions may take such forms as suspensions, solutions or emulsions in oily or aqueous 
25 vehicles, and may contain formulating agents such as suspending, stabilizing and/or dispersing 
agents. Alternatively, the active ingredient may be in powder form for constitution with a 
suitable vehicle, e.g., sterile pyrogen-free water, before use. 

The compounds may also be ibrmulated in rectal compositions such as 
suppositories or retention enemas, e.g., containing conventional suppository bases such as cocoa 
30 butter or other glycerides. 

In addition to the formulations described previously, the compounds may also be 
formulated as a depot preparation. Such long acting formulations may be administered by 
implantation (for example subcutaneously or intramuscularly) or by intramuscular injection. 
Thus, for example, the compounds may be ibrmulated with suitable polymeric or hydrophobic 



gelatin for use in an inhaler or insufflator may 
compound and a suitable powder base such as 
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materials (for example as an emulsion in an acceptable oil) or ion exchange resins, or as 
sparingly soluble derivatives, for example, as a sparingly soluble salt. Other suitable delivery 
systems include microspheres which offer the possibility of local noninvasive delivery of drugs 
over an extended period of time. This technology utilizes microspheres of precapillary size 
5 which can be injected via a coronary catheter into any selected part of the e.g. heart or other 
organs without causing inflammation or ischemia. The administered therapeutic is slowly 
released from these microspheres and taken up by surrounding tissue cells (e.g. endothelial 
cells). 

Systemic administration can also be by transmucosal or transdermal means. For 
10 transmucosal or transdermal administration, penetrants appropriate to the barrier to be permeated 

are used in the formulation. Such penetrants are generally known in the art, and include, for 

example, for transmucosal administration bile salts and fusidic acid derivatives. In addition, 

detergents may be used to facilitate permeation 

nasal sprays or using suppositories. For topical 
15 are formulated into ointments, salves, gels, or creams as generally known in the art. A wash 

solution can be used locally to treat an injury o : inflammation to accelerate healing. 

The compositions may, if desired, be presented in a pack or dispenser device 

which may contain one or more unit dosage forms containing the active ingredient. The pack 

may for example comprise metal or plastic foil, such as a blister pack. The pack or dispenser 
20 device may be accompanied by instructions for administration. 



Transmucosal administration may be through 
administration, the oligomers of the invention 



4.4 Assays to Identify Therapeutics 

Based on the identification of mutations that cause or contribute to the 
development of a disease or disorder that is associated with an IL-1 polymorphism or 

25 haplotype, the invention further features cell-based or cell free assays for identifying 
therapeutics. In one embodiment, a cell expressing an IL-1 receptor, or a receptor for a 
protein that is encoded by a gene which is in linkage disequilibrium with an IL-1 gene, on the 
outer surface of its cellular membrane is incu bated in the presence of a test compound alone or 
in the presence of a test compound and another protein and the interaction between the test 

30 compound and the receptor or between the p rotein (preferably a tagged protein) and the 
receptor is detected, e.g., by using a microphysiometer (McConnell et al. (1992) Science 
257:1906). An interaction between the receptor and either the test compound or ihe protein is 
detected by the microphysiometer as a change in the acidification of the medium. This assay 
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system thus provides a means of identifying molecular antagonists which, for example, 
function by interfering with protein- receptor interactions, as well as molecular agonist which, [ 
for example, function by activating a receptor. 

Cellular or cell-free assays can also be used to identify compounds which 
modulate expression of an IL-1 gene or a gene in linkage disequilibrium therewith, modulate 
translation of an mRNA, or which modulate the stability of an mRNA or protein. 



Accordingly, in one embodiment, a cell which^ 
protein is incubated with a test compound and 



is capable of producing an IL-1, or other 
the amount of protein produced in the cell 



medium is measured and compared to that produced from a cell which has not been contacted 
10 with the test compound. The specificity of the; compound vis a vis the protein can be 

confirmed by various control analysis, e.g., measuring the expression of one or more control 
genes. In particular, this assay can be used to determine the efficacy of antisense, ribozyme 
and triplex compounds. 

Cell-free assays can also be used to identify compounds which are capable of 



15 interacting with a protein, to thereby modify t 



le activity of the protein. Such a compound can, 



e.g., modify the structure of a protein thereby effecting its ability to bind to a receptor. In a 
preferred embodiment, cell-free assays for identifying such compounds consist essentially in a 
reaction mixture containing a protein and a test compound or a library of test compounds in the 
presence or absence of a binding partner. A test compound can be, e.g., a derivative of a 
20 binding partner, e.g., a biologically inactive target peptide, or a small molecule. 

Accordingly, one exemplary screening assay of the present invention includes 
the steps of contacting a protein or functional fragment thereof with a test compound or library 
of test compounds and detecting the formation of complexes. For detection purposes, the 
molecule can be labeled with a specific marker and the test compound or library of test 



25 compounds labeled with a different marker. 



fragment thereof can then be detected by determining the level of the two labels after an 
incubation step and a washing step. The presence of two labels after the washing step is 
indicative of an interaction. 

An interaction between molecules can also be identified by using real-time BIA 
30 (Biomolecular Interaction Analysis, Pharmacia Biosensor AB) which detects surface plasmon 
resonance (SPR), an optical phenomenon. Detection depends on changes in the mass 
concentration of macromolecules at the biospecific interface, and does not require any labeling 



nteraction of a test compound with a protein or 



15 
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of interactants. In one embodiment, a library o 
sensor surface, e.g., which forms one wall of a 



test compounds can be immobilized on a 
micro-flow cell. A solution containing the 



protein or functional fragment thereof is then flown continuously over the sensor surface. A 
change in the resonance angle as shown on a signal recording, indicates that an interaction has 
occurred. This technique is further described, e.g., in BIAtechnology Handbook by 
Pharmacia. 

Another exemplary screening assay of the present invention includes the steps of 
(a) forming a reaction mixture including: (i) an IL-1 or other protein, (ii) an appropriate receptor, 
and (iii) a test compound; and (b) detecting interaction of the protein and receptor. A statistically 



10 significant change (potentiation or inhibition) in 



presence of the test compound, relative to the interaction in the absence of the test compound, 



indicates a potential antagonist (inhibitor). The 
simultaneously. Alternatively, a protein can firi 



the interaction of the protein and receptor in the 



compounds of this assay can be contacted 
t be contacted with a test compound for an 



appropriate amount of time, following which the receptor is added to the reaction mixture. The 
efficacy of the compound can be assessed by generating dose response curves from data obtained 
using various concentrations of the test compound. Moreover, a control assay can also be 
performed to provide a baseline for comparison. 

Complex formation between a protein and receptor may be detected by a variety 
of techniques. Modulation of the formation of complexes can be quantitated using, for example, 

, fluorescently labeled, or enzymatically labeled 



20 detectably labeled proteins such as radiolabeled 

proteins or receptors, by immunoassay, or by cliromatographic detection. 



Typically, it will be desirable to 



immobilize either the protein or the receptor to 



facilitate separation of complexes from uncomplexed forms of one or both of the proteins, as 
well as to accommodate automation of the assay. Binding of protein and receptor can be 
25 accomplished in any vessel suitable for containing the reactants. Examples include microtitre 



plates, test tubes, and micro-centrifuge tubes. 



n one embodiment, a fusion protein can be 



provided which adds a domain that allows the protein to be bound to a matrix. For example, 
glutathione-S-transferase fusion proteins can bz adsorbed onto glutathione sepharose beads 
(Sigma Chemical, St. Louis, MO) or glutathione derivatized microtitre plates, which are then 
30 combined with the receptor, e.g. an 35s-labeled receptor, and the test compound, and the mixture 
incubated under conditions conducive to complex formation, e.g. at physiological conditions for 
salt and pH, though slightly more stringent conditions may be desired. Following, incubation, the 
beads are washed to remove any unbound label, and the matrix immobilized and radiolabel 
determined directly (e.g. beads placed in scint: llant), or in the supernatant after the complexes are 
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subsequently dissociated. Alternatively, the complexes can be dissociated from the matrix, 
separated by SDS-PAGE, and the level of protein or receptor found in the bead fraction 
quantitated from the gel using standard electropl.oretic techniques such as described in the 
appended examples. Other techniques for immobilizing proteins on matrices are also available 



5 for use in the subject assay. For instance, either 



protein or receptor can be immobilized utilizing 



conjugation of biotin and streptavidin. Transgenic animals can also be made to identify agonists 
and antagonists or to confirm the safety and efficacy of a candidate therapeutic. Transgenic 
animals of the invention can include non-human, animals containing a restenosis causative 
mutation under the control of an appropriate enc ogenous promoter or under the control of a 
1 0 heterologous promoter. 

The transgenic animals can also be animals containing a transgene, such as 
reporter gene, under the control of an appropriate promoter or fragment thereof. These animals 



are useful, e.g., for identifying drugs that modu' 



ate production of an IL-1 protein, such as by 



modulating gene expression. Methods for obtaining transgenic non-human animals are well 



15 known in the art. In preferred embodiments, th 



is restricted to specific subsets of cells, tissues or developmental stages utilizing, for example 



cis-acting sequences that control expression in 
mosaic expression of a protein can be essential 



expression of the restenosis causative mutation 



:he desired pattern. In the present invention, such 
for many forms of lineage analysis and can 



additionally provide a means to assess the effects of, for example, expression level which might 
grossly alter development in small patches of tissue within an otherwise normal embryo. 
Toward this end, tissue-specific regulatory sequences and conditional regulatory sequences can 
be used to control expression of the mutation in certain spatial patterns. Moreover, temporal 
patterns of expression can be provided by, for example, conditional recombination systems or 
prokaryotic transcriptional regulatory sequences. Genetic techniques, which allow for the 
25 expression of a mutation can be regulated via site-specific genetic manipulation in vivo, are 
known to those skilled in the art. 

The transgenic animals of the present invention all include within a plurality of 
their cells a causative mutation transgene of the present invention, which transgene alters the 
phenotype of the "host cell". In an illustrative embodiment, either the crelloxP recombinase 
system of bacteriophage PI (Lakso et al. (1992) PNAS 89:6232-6236; Orban et al. (1992) PNAS 
89:6861-6865) or the FLP recombinase systeri of Saccharomyces cerevisiae (O'Gorman et al. 
(1991) Science 251:1351-1355; PCT publication WO 92/15694) can be used to generate in vivo 
site-specific genetic recombination systems. Cre recombinase catalyzes the site-specific 
recombination of an intervening target sequence located between loxP sequences. loxP 
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sequences are 34 base pair nucleotide repeat sequences to which the Cre recombinase binds and 
are required for Cre recombinase mediated genetic recombination. The orientation of loxP 
sequences determines whether the intervening target sequence is excised or inverted when Cre 
recombinase is present (Abremski et al. (1984) J, Biol Chem. 259:1509-1514); catalyzing the 
excision of the target sequence when the loxP sequences are oriented as direct repeats and 
catalyzes inversion of the target sequence when loxP sequences are oriented as inverted repeats. 

Accordingly, genetic recombination of the target sequence is dependent on 
expression of the Cre recombinase. Expression of the recombinase can be regulated by promoter 



elements which are subject to regulatory contro 



causative mutation transgene can be regulated \ 



e.g., tissue-specific, developmental 



stage-specific, inducible or repressible by externally added agents. This regulated control will 
result in genetic recombination of the target sequence only in cells where recombinase 
expression is mediated by the promoter elemem;. Thus, the activation of expression of the 



ia control of recombinase expression. 



Use of the crelloxP recombinase system to regulate expression of a causative 

15 mutation transgene requires the construction of a transgenic animal containing transgenes 
encoding both the Cre recombinase and the subject protein. Animals containing both the Cre 
recombinase and the restenosis causative mutation transgene can be provided through the 
construction of "double" transgenic animals. A convenient method for providing such animals is 
to mate two transgenic animals each containing a transgene. 

20 Similar conditional transgenes c an be provided using prokaryotic promoter 

sequences which require prokaryotic proteins to be simultaneous expressed in order to facilitate 
expression of the transgene. Exemplary promoters and the corresponding trans-activating 
prokaryotic proteins are given in U.S. Patent No. 4,833,080. 

Moreover, expression of the conditional transgenes can be induced by gene 

25 therapy-like methods wherein a gene encoding the transactivating protein, e.g. a recombinase or 
a prokaryotic protein, is delivered to the tissue and caused to be expressed, such as in a cell-type 
specific manner. By this method, the transgene could remain silent into adulthood until "turned 
on" by the introduction of the transactivator. 

In an exemplary embodiment, ihe "transgenic non-human animals" of the 

30 invention are produced by introducing transgenes into the germline of the non-human animal. 
Embryonal target cells at various developmental stages can be used to introduce transgenes. 



Different methods are used depending on the 



stage of development of the embryonal target cell. 



The specific line(s) of any animal used to practice this invention are selected for general good 



health, good embryo yields, good pronuclear 



visibility in the embryo, and good reproductive 
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fitness. In addition, the haplotype is a significani 



factor. For example, when transgenic mice are 



to be produced, strains such as C57BL/6 or FVB lines are often used (Jackson Laboratory, Bar 
Harbor, ME). Preferred strains are those with Hj2 b , H-2 d or H-2<1 haplotypes such as C57BL/6 
or DBA/L The line(s) used to practice this invejition may themselves be transgenics, and/or may 
be knockouts (i.e., obtained from animals which have one or more genes partially or completely 
suppressed) . 

In one embodiment, the transgene construct is introduced into a single stage 
embryo. The zygote is the best target for micioinjection. In the mouse, the male pronucleus 
reaches the size of approximately 20 micrometers in diameter which allows reproducible 
injection of 1-2 pi of DNA solution. The use of zygotes as a target for gene transfer has a 
major advantage in that in most cases the injected DNA will be incorporated into the host gene 



before the first cleavage (Brinster et al. (1985) 
cells of the transgenic animal will carry the in 



PNAS 82:4438-4442). As a consequence, all 
xtorporated transgene. This will in general also be 



reflected in the efficient transmission of the transgene to offspring of the founder since 50% of 

15 the germ cells will harbor the transgene. 

Normally, fertilized embryos are incubated in suitable media until the pronuclei 
appear. At about this time, the nucleotide sequence comprising the transgene is introduced into 
the female or male pronucleus as described below. In some species such as mice, the male 
pronucleus is preferred. It is most preferred that the exogenous genetic material be added to the 

20 male DNA complement of the zygote prior to its being processed by the ovum nucleus or the 
zygote female pronucleus. It is thought that the ovum nucleus or female pronucleus release 
molecules which affect the male DNA complement, perhaps by replacing the protamines of the 
male DNA with histones, thereby facilitating t ie combination of the female and male DNA 
complements to form the diploid zygote. Thus, it is preferred that the exogenous genetic 

25 material be added to the male complement of DNA or any other complement of DNA prior to its 
being affected by the female pronucleus. For example, the exogenous genetic material is added 
to the early male pronucleus, as soon as possible after the formation of the male pronucleus, 



which is when the male and female pronuclei 



ire well separated and both are located close to the 



cell membrane. Alternatively, the exogenous genetic material could be added to the nucleus of 
30 the sperm after it has been induced to undergo decondensation. Sperm containing the exogenous 
genetic material can then be added to the ovum or the decondensed sperm could be added to the 
ovum with the transgene constructs being added as soon as possible thereafter. 
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Introduction of the transgene nuc 



eotide sequence into the embryo may be 



accomplished by any means known in the art such as, for example, microinjection, 
electroporation, or lipofection. Following introdiction of the transgene nucleotide sequence into 
the embryo, the embryo may be incubated in vitro for varying amounts of time, or reimplanted 
into the surrogate host, or both. In vitro incubation to maturity is within the scope of this 
invention. One common method in to incubate the embryos in vitro for about 1-7 days, 
depending on the species, and then reimplant thsm into the surrogate host. 

For the purposes of this invention a zygote is essentially the formation of a 
diploid cell which is capable of developing into a complete organism. Generally, the zygote will 
be comprised of an egg containing a nucleus foirmed, either naturally or artificially, by the fusion 
of two haploid nuclei from a gamete or gamete^;. Thus, the gamete nuclei must be ones which are 
naturally compatible, i.e., ones which result in a viable zygote capable of undergoing 
differentiation and developing into a functioning organism. Generally, a euploid zygote is 
preferred. If an aneuploid zygote is obtained, then the number of chromosomes should not vary 
by more than one with respect to the euploid number of the organism from which either gamete 



originated. 



In addition to similar biological 



considerations, physical ones also govern the 
amount (e.g., volume) of exogenous genetic material which can be added to the nucleus of the 
zygote or to the genetic material which forms <i part of the zygote nucleus. If no genetic material 
is removed, then the amount of exogenous genetic material which can be added is limited by the 
amount which will be absorbed without being physically disruptive. Generally, the volume of 
exogenous genetic material inserted will not exceed about 10 picoliters. The physical effects of 
addition must not be so great as to physically destroy the viability of the zygote. The biological 
limit of the number and variety of DNA sequences will vary depending upon the particular 
zygote and functions of the exogenous genetic material and will be readily apparent to one 

including the exogenous genetic material, of the 
of initiating and maintaining the differentiation and 
development of the zygote into a functional oirganism. 

The number of copies of the transgene constructs which are added to the zygote is 
dependent upon the total amount of exogenous genetic material added and will be the amount 
which enables the genetic transformation to occur. Theoretically only one copy is required; 
however, generally, numerous copies are utilized, for example, 1,000-20,000 copies of the 
transgene construct, in order to insure that one copy is functional. As regards the present 



skilled in the art, because the genetic material 
resulting zygote must be biologically capable 



invention, there will often be an advantage tc 



having more than one functioning copy of each of 
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the inserted exogenous DNA sequences to enhance the phenotypic expression of the exogenous 
DNA sequences. 

Any technique which allows for t tie addition of the exogenous genetic material 
into nucleic genetic material can be utilized so long as it is not destructive to the cell, nuclear 
membrane or other existing cellular or genetic structures. The exogenous genetic material is 
preferentially inserted into the nucleic genetic material by microinjection. Microinjection of cells 
and cellular structures is known and is used in the art. 

Reimplantation is accomplished using standard methods. Usually, the surrogate 
host is anesthetized, and the embryos are inserted into the oviduct. The number of embryos 
implanted into a particular host will vary by species, but will usually be comparable to the 
number of off spring the species naturally prodi ces. 



Transgenic offspring of the surrc 



and/or expression of the transgene by any suitable method. Screening is often accomplished by 



Southern blot or Northern blot analysis, using a 



gate host may be screened for the presence 



15 of the transgene. Western blot analysis using an. antibody against the protein encoded by the 



transgene may be employed as an alternative or 



probe that is complementary to at least a portion 



additional method for screening for the presence 



of the transgene product. Typically, DNA is prepared from tail tissue and analyzed by Southern 
analysis or PCR for the transgene. Alternatively, the tissues or cells believed to express the 
transgene at the highest levels are tested for the presence and expression of the transgene using 
Southern analysis or PCR, although any tissues or cell types may be used for this analysis. 

Alternative or additional methods for evaluating the presence of the transgene 
include, without limitation, suitable biochemical assays such as enzyme and/or immunological 
assays, histological stains for particular marker or enzyme activities, flow cytometric analysis, 
and the like. Analysis of the blood may also b s useful to detect the presence of the transgene 



25 product in the blood, as well as to evaluate the 



effect of the transgene on the levels of various 



types of blood cells and other blood constituents. 

Progeny of the transgenic anim lis may be obtained by mating the transgenic 
animal with a suitable partner, or by in vitro fe rtilization of eggs and/or sperm obtained from the 
transgenic animal. Where mating with a partner is to be performed, the partner may or may not 
be transgenic and/or a knockout; where it is transgenic, it may contain the same or a different 
transgene, or both. Alternatively, the partner may be a parental line. Where in vitro fertilization is 
used, the fertilized embryo may be implanted into a surrogate host or incubated in vitro, or both. 



Using either method, the progeny may be eval 



methods described above, or other appropriate; methods 



uated for the presence of the transgene using 
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The transgenic animals produced in accordance with the present invention will 
include exogenous genetic material. Further, in such embodiments the sequence will be attached 
to a transcriptional control element, e.g., a promoter, which preferably allows the expression of 
the transgene product in a specific type of cell. 
5 Retroviral infection can also be used to introduce the transgene into a non-human 

animal. The developing non-human embryo can be cultured in vitro to the blastocyst stage. 
During this time, the blastomeres can be targets for retroviral infection (Jaenich, R. (1976) PNAS 
73:1260-1264). Efficient infection of the blastomeres is obtained by enzymatic treatment to 
remove the zona pellucida (Manipulating the Mouse Embryo, Hogan eds. (Cold Spring Harbor 

10 Laboratory Press, Cold Spring Harbor, 1986). The viral vector system used to introduce the 
transgene is typically a replication-defective retrovirus carrying the transgene (Jahner et al. 
(1985) PNAS 82:6927-6931; Van der Putten et al. (1985) PNAS 82:6148-6152). Transfection is 
easily and efficiently obtained by culturing the blastomeres on a monolayer of virus-producing 
cells (Van der Putten, supra; Stewart et al. (1987) EMBO J. 6:383-388). Alternatively, infection 

15 can be performed at a later stage. Virus or virus-producing cells can be injected into the 

blastocoele (Jahner et al. (1982) Nature 298:623-628). Most of the founders will be mosaic for 
the transgene since incorporation occurs only in a subset of the cells which formed the transgenic 
non-human animal. Further, the founder may contain various retroviral insertions of the 
transgene at different positions in the genome which generally will segregate in the offspring. In 

20 addition, it is also possible to introduce transgenes into the germ line by intrauterine retroviral 
infection of the midgestation embryo (Jahner et al. (1982) supra). 

A third type of target cell for transgene introduction is the embryonal stem cell 
(ES). ES cells are obtained from pre-implantation embryos cultured in vitro and fused with 
embryos (Evans et al. (1981) Nature 292:154-156; Bradley et al (1984) Nature 309:255-258; 

25 Gossler et al. (1986) PNAS 83: 9065-9069; and Robertson et al, (1986) Nature 322:445-448). 
Transgenes can be efficiently introduced into the ES cells by DNA transfection or by retrovirus- 
mediated transduction. Such transformed ES cells can thereafter be combined with blastocysts 
from a non-human animal. The ES cells thereafter colonize the embryo and contribute to the 
germ line of the resulting chimeric animal. For review see Jaenisch, R. (1988) Science 240:1468- 

30 1474. 

The present invention is further illustrated by the following examples which 
should not be construed as limiting in any way. The contents of all cited references (including 
literature references, issued patents, published patent applications as cited throughout this 
application) are hereby expressly incorporated by reference. The practice of the present 
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invention will employ, unless otherwise indicated, conventional techniques that are within the 
skill of the art. Such techniques are explained fully in the literature. See, for example, 
Molecular Cloning A Laboratory Manual, (2nd ed., Sambrook, Fritsch and Maniatis, eds., Cold 
Spring Harbor Laboratory Press: 1989); DNA Cloning, Volumes I and II (IX N, Glover ed., 
5 1985); Oligonucleotide Synthesis (M. J. Gait ed., 1984); U.S. Patent No. 4,683,195; U.S. Patent 
No. 4,683,202; and Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins eds., 1984). 
5. EXAMPLES 
Example 1 Genotyping 

All human subjects were unrelated, Caucasian, healthy blood donors from 
10 Sheffield (n = 112). Subjects were typed at the loci indicated in Table 1. 



Table 2.Markers Used in Haplotype Study 



Marker 


Gene 


Reference 


2221223 


ILIA 


Todd& Naylor, Nucleic Acids Res. 19: 3756(1991) 


gz5/gz6 


ILIA 


Zuliani, et al, Am. J. Hum. Genet. 46: 963-69 (1990) 


-889 


ILIA 


McDowell, etal, Arth. & Rheum. 38: 221 -8(1995) 


+3954 


IL1B 


di Giovine, et al., Cytokine 7(6): 606 (1995) 


-511 


IL1B 


di Giovine, Hum. Molec. Genet. 1(6): 450 (1992) 


gaat.p33330 


between IL1B 
and IL1RN 


Murray, et al., Coop. Hum. Link. Center, unpublished 


Y31 


between IL1B 
and 1L1RN 


Spurr, etal, Cytogenet. & Cell Genet. 73: 255-73 (1996) 


VNTR 


IL1RN 


Tarlow, etal., Hum. Genet. 91: 403-4 (1993) 



The primer sequences and fluorescent labels used in PCR amplification of 
markers were as in Table 3. 

25 



Table 2.Primer Sequence and Flourescent Label for Genotyping 



Marker 


Label 


Primer Sequence 


2221223 


HEX 


ATGTATAGAATTCCATTCCTG (SEQ ID NO. 8) 




TAAAATCAAGTGTTGATGTAG (SEQ ID NO. 9) 


gz51gz6 


FAM 


GGGA7TACAGGCGTGAGCCACCGCG (SEQ ID NO. 10) 




TTAGTATTGCTGGTAGTATTCATAT (SEQ ID NO. 11) 


-889 


NONE 


TGTTCTACCACCTGAACTAGG (SEQ ID NO. 12) 
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Marker 


Label 


Primer Sequence 






TTACATATGAGCCTTCCATG (SEQ ID NO. 13) 


+3954 


NONE 


CTCAGGTGTCCTCGAAGAAATCAAA (SEQ ID NO. 14) 




GCTTTMGCTGTGAGTCCCG (SEQ ID NO. 15) 


-511 


NONE 


TGGCATTGATCTGGTTCATC (SEQ ID NO. 16) 






GTTTAGGAATCTTCCCACTT(SEO ID NO. 17) 


gaat.p33330 


rAM 


oAOCjCCjICjACjAAICICAACjA (orilj ID NU. loj 




GTGTCCTCAAGTGGATCTGG (SEQ 10 NO. 19) 


Y31 


HEX 


GGGCAACAGAGCAATGTTTCT (SEQ ID NO. 20) 




CAGTGTGTCAGTGTACTGTT (SEQ ID NO. 21) 


VNTR 


NONE 


CTCAGCAACACTCCTAT (SEQ ID NO. 22) 




TCCTGGTCTGCAGGTAA (SEQ ID NO. 23) 



Reaction conditions were as described in Table 4. 



10 

Table 4 .Reaction Conditions 



Marker 


Conditions 


222/223 


50 mM KCI, 10 mM Tris-HCI pH 9.0,1.5 mM MgCI 2 , 200:mM dNTPs, 25 
ng primers, 50 ng template, 0.004% W-l (Gibco-BRL) 0.2 u Taq, PCR was 
done at 30 cycles of 94°C for 1". 55 °C for 1", 72 °C for 1" 


gz5/gz6 


as per marker 2221223, except 1 u of Perfect Match (StrataGene) was added 


-889 


per marker 222/223, except PCR was done for 1 cycle at 96 °C for 1 ", 40 
cycles of 94°C for 1", 46°C for 1" 72°C for 1" and 1 cycle of 72°C for 4", 
products were cleaved with Ncol for analysis 


+3954 


as per marker 222/223, except PCR was done for 35 cycles with annealing at 
67.5 °C, products were cleaved with Taq 1 for analysis 


-511 


as per marker 2221223, except PCR was done for 1 cycle at 95 °C for 2", 35 
cycles of 95 °C for 1", 53 °C for 1" 74 °C for 1" and 1 cycle of 74 °C for 4", 
products were cleaved with Aval and Bsu361 for analysis 


gaat.p33330 


per marker 222/223 


Y31 


per marker 222/223 


VNTR 


per marker 222/223 except with 1.7 mM MgCI 2 , 1 cycle at 96 °C for 1"; 30 
cycles of 94°C for 1", 60°C for 1", 70°C for 1" and 1 cycle at 70°C for 2" 
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2221223, gz5/gz6, gaat.p33330 and Y31 PCR products were examined by agarose 
gel electrophoresis and the remainder of the PCR products were pooled according to the intensity 
of ethidium bromide staining. 2 gl of the pool was analyzed on an ABI 373 A automated 
sequencer and allele sizes were determined using the Genescan and Genotyper software. Alleles 
5 were globally binned using a simple computer program and numbered in order of size. 

-889 PCR products were digested with Ncol and the resulting fragments sized on 
8% PAGE. Allele 1 produces 83 and 16 bp fragments. Allele 2 produces a 99 bp fragment. 

+3954 PCR products were digested with restriction enzyme Taq I. Allele 1 
produces fragments of 97, 85 and 12 bp, and allele 2 produces fragments of 182 and 12 bp. 
10 -511 PCR products were digested with Aval and Bsu36I and the fragments were 

sized by 8% PAGE. Allele 1 produces 190 and 1 14 bp fragments when digested with Aval and a 
304 bp fragment when digested with Bsu36I. Allele 2 produces a 304 bp fragment when 
digested with Aval and 190 and 1 14 bp fragments when digested with Bsu36I. 

VNTR PCR products were sized by electrophoresis on 2% agarose gel at 90V for 
15 45 minutes. Allele 1 has 4 repeats and the PCR product is 412 bp, allele 2 has 2 repeats and the 
PCR product is 240 bp, allele 3 has 3 repeats and the PCR product is 326 bp, allele 4 has 4 
repeats and the PCR product is 498 bp, allele 5 has 6 repeats and the PCR product is 584 bp. 

Intergenic distances were determined by estimation based on the insert sizes of 
relevant PAC clones from a contig spanning the IL-1 gene cluster (Nicklin, et aL, Genomics 
20 1 9:382-4 (1994)), Intragenic distances were determined from the relevant nucleotide sequence 
obtained form the GENBANK database. 

Example 2 Method for Estimating Linkage Disequilibrium 

Because four of the markers studied herein are multiallelic, a preliminary analysis 

25 was carried out to determine which allelic combinations between pairs of loci contributed to the 
greatest disequilibrium, in order that the disequilibrium would not be masked when the alleles 
were grouped into biallelic systems. The E.H. program of Xie and Ott (Handbook of Human 
Genetic Link-age, 1994, John Hopkins University Press, 188-98), incorporated by reference 
herein, was used to estimate haplotype frequencies under H 0 (no linkage) and H, (allelic linkage 

30 allowed). It was found that the elaborate allele grouping, strategy had some advantages over 
commonly used methods, in that disequilibrium was detected between almost all*pairwise 
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combinations of markers examined and there was good correlation between disequilibrium and 
physical distance. 

More specifically, the E.H. program of Xie and Ott was used to determine 
maximum likelihood estimates of disequilibriumn (Dj) between each pairwise combination of 
5 alleles, where = h-j - pfl. are the frequencies for allele i at locus 1 and allele j at locus 2 
respectively, and hy is the frequency of the haplotype ij. The program calculated maximum 
likelihood values for the haplotype frequencies (and hence allele frequencies) under H 0 (no 
association) and haplotype frequencies under Hj (allelic association allowed). For markers with 
greater than two alleles, the E. H. estimate for allele frequencies correlated poorly with the 

10 allele frequencies as estimated directly from the sample population, and therefore gave no 
confidence to the Dy estimates given. It was therefore necessary to group alleles of the 
multi-allelic markers into a biallelic system. Analysis of the markers in a biallelic format has 
the added advantages that the notation D ijs p p and q } can be simplified to D, p, and q 
respectively, where p and q are defined to be the frequencies of the rarer alleles at both loci 

15 (such that without loss of generality p < q < 0.5), and D is the estimated disequilibrium 
between those alleles. 

Under a biallelic system, power is also much simpler to determine using equations 
as detailed by Hill (Hill, Heredity, 33: 229-39 (1974)). In addition, the sign of D becomes 

A A 

informative, such thatD > 0 when the rarer alleles at each of the two loci are associated, andD < 
20 0 when the rare allele at one locus is associated with the common allele at the other locus. 

Because the method of allele grouping clearly affected the power to detect 
disequilibrium (Zouros, et al, Genet. 85: 543-50 (1977); Weir, et al t Genet 88: 633-42 (1976)), 
a preliminary analysis was conducted to ensure that the grouping did not mask disequilibrium 
between subsets of alleles. In this analysis, 6- = (O^ - E^Af was calculated for each 
25 haplotype, where E y is the expected number of haplotypes ij assuming equilibrium (E 0 - = 2n p^, 
where n = number of individuals in the study), and O'y is a basic estimate for the observed 
haplotype count determined as follows. All genotypes that could be unambiguously resolved 
were haplotype counted. Each double heterozygote (iji/j j 2 ) could be resolved into two possible 
haplotype sets, [i-J^i^] or [i-J 2 >y - j. Using the haplotype frequencies as estimated from the 
30 unambiguous haplotype count, the probability of each set was calculated and used as a "partial" 
count. In this way the ambiguous genotypes were also haplotype counted, and the total counts 
(ambiguous plus unambiguous) constituted the Oys used in 5^. Once established,* the magnitude 
and sign of the 5~s were used to determine which allelic combinations showed greatest deviation 
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from the null hypothesis of no association. This information was used to group alleles at the 
multiallelic loci into biallelic systems to enable efficient use of the E.H. program. 

In order to compare the degree of disequilibrium between different pairwise 
combinations of loci, a frequency independent measure of disequilibrium D, the proportion of 
5 maximum possible disequilibrium in the given direction) was calculated, where D — D/|D max | 

(Thompson, et aL, Am. J. Hum. Genet. 42: 1 13-24 (1988)). The relationship between p and q are 

A 

such that p < q <, 0.5, and it can therefore be written that -pq < D < p(l-q) such that whenD< 0, 

A 

D max = -pq and when D>0, D max = p(l-q). Output from the E.H. program included log-likelihoods 
for the maximum likelihood parameter values under H 0 and H 1? and since -21n (Lq/LJ-X 2 . 
10 where L Q and L { are the likelihoods under H 0 and H l5 p- values could then be determined for 
each test. 

A 

The asymptotic variance for D, under H Q : D = 0 and H x were computed using the 
formula as defined by Hill (Heredity 33: 229-39 (1974)) for genotypic data. Using these, the 
power for each pairwise comparison could be calculated. 

15 Common haplotypes containing all 8 loci were identified from the preliminary 

analysis of 6- described above, and backed up by the magnitude and sign of the disequilibria 
once the alleles at the multiallelic loci had been grouped. For these loci, the allele in the group 
which contributed most to the disequilibrium has been identified on the haplotype. To estimate 
the population haplotype frequencies, rates of carriage of at least one copy of the relevant alleles 

20 in the population were determined. These do not represent true haplotypes since phase is 

unknown. Monte Carlo simulation techniques were used to test for significant deviation from a 
simulated null distribution for these combined carriages under the assumption of no association. 



Example 3 Estimation of Linkage Disequilibrium in the IL-1 Gene Cluster 
25 A number of biallelic and multiallelic markers in and around the IL-1 genes have 

been identified. However, the extent of linkage disequilibrium between the markers, and the 
prevalence of multimarker haplotypes in the general population have not until now been 
identified. 

Figure 1 shows the relative positions of the 8 marker loci used in this study. DNA 
30 samples from 212 unrelated healthy volunteers were geno typed for each of these markers, and 
the resulting estimates of allele frequencies are shown in Table 5. 



Table 5. 
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Estimated frequencies of marker alleles 
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r: 

ireq. 


gz5/gz6 


ireq. 
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-8o9 


freq. 
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ireq. 
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0.005 


1(79 bp) 


0.003 


l(Ncol) 
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0.812 
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0.018 


2 ( 82 bo"! 


0.005 
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0.286 


2 


0.188 
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0.378 
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0.676 












0.299 
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5 (134 bp) 


0.016 
















A OAS? 














7 f 138 dd> 
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Y31 


freq. 


VNTR 


frcq. 


i 


0.618 


1(189 bo 1 * 


0.658 


1 H48 hn) 


0.092 


1 


0.744 


2 (Rsu361> 


0.382 


2 n 03 bn* 


0.002 


2 (158 bx>) 


0.008 


2 


0.256 






3(197 brA 


0.255 


3 (160 bol 


0.454 










4 (201 bo} 

r \ JL,\J 1 L/ U> / 


0.084 


4 (162 bo> 


0.062 














5 f 164 br^ 


0.003 














6 (166 bp) 


0.122 














7(168 bo) 


0.035 














8 (170 bp) 


0.030 














9 (172 bp) 


0.095 














10(174 bp) 


0.087 














11 (176 bp) 


0.003 














12(178 bp) 


0.011 








398 




404 




370 




398 



30 ^number of chromosomes analyzed 

Note - Allele names (and sizes) are given in boldface. 



To determine the linkage disequilibria between pairwise combinations of loci, the 
computer program of Xie and Ott was used. This program was found to be most efficient when 
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used with biallelic systems, therefore alleles at the multiallelic loci were grouped in the most 
appropriate way for each pairwise comparison, such that disequilibrium between subsets of 
alleles was not masked. 

In Table 6, the disequilibria between pairs of loci are expressed as D, the ratio of 

A 

5 D to its maximum value D max and are shown together with the approximate physical distances 
between the loci in kilobase pairs. 

t 

~ A 

Table 6 .Disequilibrium (D = D/|D max | ) and physical distances between markers 





222/223 


gz5/gz6 


-889 


+3953 


222/223 




+0.872 


+0.829 


+0.710 


gz5/gz6 


2.5 




-0.889 


-0.695 


-889 


7 


4.5 




+0.804 


+3953 


55 


55 


50 




-511 


60 


60 


55 


4.5 


gaat.p33330 


260 


260 


255 


205 


Y31 


310 


310 


305 


255 


VNTR 


380 


380 


375 


325 




-511 


^aat.p33330 


Y31 


VNTR 


222/223 


+0.535 


+0433 


+0.364 


-0.499 


gz5/gz6 


+0.540 


+0.517 


-0.503 


+0.286 


-889 


-0.264 


+0.337 


+0.318 


-0.207 


+3954 


-0.617 


+0.409 


-0.475 


0.439 


-511 




+0.691 


-0.456 


+0.448 


gaat.p33330 


200 




+0.639 


+0.442 


Y31 


250 


50 




-0.765 


VNTR 


320 


120 


70 





Note - disequilibrium values are shown at the top right, approximate physical distances in Kb are shown at the 
30 bottom left. Intergenic distances are given to the nearest 5 Kb. 

Table 7 shows the power to detect 50% D max for each locus combination, and the 
p values for each corresponding D. 
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Table 7.Power to detect 50% D TOax and p values of -2Ln (L 0 L t ) 





222/223 


gz5/gz6 


-889 


+3953 


22/223 


- 


-100(+) 


-100(+) 


98(+) 


gz5/gz6 


<lxl0- 10 


- 


87(-) 


60(-) 


-889 


<lxlO !0 


<3xl0" 8 


- 


-100(+) 


+3954 


-lxlO" 7 


*-9xl0" 3 


<lxl0 10 


- 


-511 


-9x1 cr 10 


-4xl0 10 


*-9.4xl0" 2 


*-2.6xl0" 2 


gaat.p3330 


-9x1 0' 8 


-2xl0" 9 


*-1.7xl0" 2 


-5x10^ 


Y31 


-lxlO" 4 


-4x1 0 A 


-6x1 0" 4 


-lxlO" 7 


VNTR 


-lxl 0" 3 


-lxlO" 3 


*-3xl0-' 


*-1.2xl0 1 




-511 


gaat.p33330 


Y31 


VNTR 


22/223 


~100(+) 


~100(+) 


~100(+) 


93(-) 


gz5/gz6 


~100(+) 


~100(+) 


98(-) 


~100(+) 


-889 


96(-) 


89(+) 


~100(+) 


78(-) 


+3954 


79(-) 


97(+) 


~100(+) 


52(-) 


-511 




~100(+) 


~100(-) 


~100(+) 


gaat.p33330 


<lxlO" 10 




49(+) 


~100(+) 


Y31 


-2X10" 4 


*~7xlO' 3 




89(-) 


VNTR 


~8xl0" 6 


~lxl(T 9 


2xl0' 7 





20 Note - Power is shown at the top right with the sign of disequilibrium in brackets; pointwise p-values 
are shown (uncorrected) at the bottom left. For an overall significance level of p = 0.05, pointwise 
significance level is 0.0018 for 28 comparisons. 
* Not significant at p = 0.0018 threshhold 

25 Significant linkage disequilibrium (p corr < 0.05) was detected between most 

combinations of loci, with only a few exceptions. These include the comparisons between the 
VNTR and the more distant biallelic markers, +3954, and -889, in which the disequilibrium is in 
the negative direction and consequently the power is reduced (Table 7). The correlation between 
disequilibrium D and physical distance was r= -0.752 (p < 0.0001, one tailed) (Figure,2). 

30 

In order to compare different grouping methods for the multiallelic markers, D 
was calculated for all the comparisons involving 222/223 using two additional grouping 
strategies. The first of these was a "common allele versus the rest" approach, and the second was 
a grouping based on allele size, using the bimodal distribution of allele frequency versus size 
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which was observed for all the multiallelic markers examined. The results of this analysis are 
shown in Table 8, where D values for the three grouping methods are compared. 

Table 8.D values for three methods of grouping alleles at the multiallelic marker loci 



5 







common vs. rest 


allele size 


gz5/gz6 


0.87 


0.79 


0.77 


-899 


0.83 


0.81 


0.98 


+3954 


0.71 


*0.74 


0.77 


-511 


0.54 


*0.15 


0.61 


gaat.p33330 


0.43 


*0.03 


0.53 


Y31 


0.36 


*0.12 


0.16 


VNTR 


0.5 


0.48 


*0.04 



Note- Values re given for the disequilibrium between 222/223 and the other markers listed. 
15 * indicates not significant at p = 0.05 level, even before correction for multiple testing. 

It can be seen that the disequilibrium is not detected in several instances using 
these other grouping strategies, notably 222/223 with -511 and gaat.p33330 in the common 
versus rest approach, 222/223 with Y31 in both the common versus rest and allele size 

20 approaches, and 222/223 with VNTR in the allele size approach. 

Examination of which alleles of the multiallelic loci were contributing greatest to 
the disequilibrium, from the determination of Sy revealed the existence of 2 haplotypes 
containing alleles of all 8 loci. These were confirmed by examination of the haplotype 
frequencies and disequilibrium values obtained after the grouping. The first haplotype: alleles 

25 441 12332 (expressed in chromosome order, see Fig. 1) is the most common (carriage of 34/198), 
and is present 7 times more frequently than expected (expected = 4.5/198) (p < 0.000001). The 
second haplotype: alleles 33221461 (carriage of 2/206) was present 4 times more frequently than 
expected (expected 0.5/206), but this was not statistically significant (p ~ 0.106). However, 
examination of a larger sample size might assist in increasing the statistical significance of this 

30 finding. 

The data presented indicate a significant degree of linkage disequilibrium across 
an approximately 400 Kb stretch of chromosome 2ql3. The disequilibrium was strong both for 
the three markers within the IL-lo gene, as might be expected, but was also strong between some 
of the more distantly separated markers (-899/+3954;D = +0.804, physical distance = 50 Kb) 
35 (Table 6). However, D was considerably diminished between the extreme ends of the cluster. 
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Within the IL-ip gene, a moderate value of D (+3954/-51 1; D = -0.617) was obtained, although 
this was not significant when corrected for multiple comparisons, probably reflecting the 
reduction in power when disequilibrium is in the negative direction (Thompson, et al, Am J 
Hum. Genet 42: 773-24(1988)). 
5 Overall, there is a good correlation between physical distance and linkage 

disequilibrium (Figure 2); r = -0.752. The reliability of r itself depends partly on the reliability of 
the estimates of both physical distance and D. Over the short distances, the physical distances 
are accurate since they are determined from known DNA sequence, whereas the longer range 
estimates are less precise. The power can be taken tentatively as one indicator of the reliability of 

10 D, since if power is low this indicates that the sample size was too small, and with low sample 
sizes the estimates for D may be unreliable. 

The success of the elaborate grouping strategy is indicated by Table 8, which 
shows several instances where disequilibrium between particular loci is apparently low or not 
detected when other commonly used grouping methods are employed. The disadvantages of the 

15 grouping strategy used here are that it is rather laborious since the information used for the 
grouping was based on an approximate estimate of the "observed" haplotype frequencies (see 
Example 2). For the more polymorphic markers the higher heterozygosity meant that the 
estimate of 5^, was less precise since there was a higher proportion of ambiguous haplotypes. 
Notwithstanding this drawback, care was taken to take into account both the sign and magnitude 

20 of 6 lj? and the frequencies of the alleles concerned. 

The method could be simplified, in a sufficiently large study, by just considering 
the unambiguous haplotypes when determining the grouping. The determination of 6^ uses the 
maximum amount of prior knowledge for the grouping of the multiallelic markers, and this may 
be the reason why disequilibrium between almost all pairwise combinations of markers was 

25 detected. 

The two haplotypes containing all 8 markers, as well as other shorter haplotypes, 

are of particular interest since it is likely that particular combinations of alleles of the IL- 1 genes 

may act in concert to determine an overall inflammatory phenotype. An understanding of which 

markers are in strong linkage disequilibrium not only allows for more rational design of genetic 

30 studies but also may provide clues to disease mechanism. Therefore, in addition to the alleles 

identified herein, the IL-1 (441 12332) haplotype may contain the following alleles: 

allele 2 of the 1731 marker of the IL1RN gene(A at position 173 1); 
allele 2 of the 1812 marker of the IL1RN gene (A at position 18 12-); 
allele 2 of the 1868 marker of the IL1RN gene(G at position 1868); 
35 allele 2 of the 1887 marker of the IL1RN gene(C at position 1887); 
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allele 2 of the 8006 marker of the IL1RN gene (contains a Hpall or Mspl site) 
allele 2 of the 8061 marker of the IL1RN gene (lacks a Mwol site) 
allele 2 of the 9589 marker of the IL1RN gene (contains an Sspl site) 

5 Furthermore, the following PCR primers may be used to amplify these alleles; 

TTACGCAGATAAGAACCAGTTTGG (SEQ ID NO. 24) 
TTTCCTGGACGCTTGCTCACCA (SEQ ID NO. 25) 
(used for 1731, 1812, 1868, and 1887) 

10 

TTCTATCTGAGGAACAACCAACTAGTAGC (SEQ ID NO. 26) 
CACCAGACTTGACACAGGACAGGCACATC (SEQ ID NO. 27) 
(used for 8006) 

15 CGACCCTCTGGGAGAAAATCCAGCAAG (SEQ ID NO. 28) 

(used with SEQ ID NO. 20 for 8006) 

ACACAGGAAGGTGCCAAGCA (SEQ ID NO. 29) 
TGCAGACAGACGGGCAAAGT (SEQ ID NO. 30) 
20 (used for 8006 and 9589) 

TTGTGGGGACCAGGGGAGAT (SEQ ID NO. 31), and 
AGCCTGGCACTCTGCTGAAT (SEQ ID NO. 32) 
(used for 9589). 

25 

Example 4 The IL-1 (441 123-32) Haplotype is Associated with Diabetic Nephropathy 

The presence of the two haplotypes described herein was investigated in healthy 
and diseased populations to determine if the haplotypes were associated with inflammatory 
disease. 81 non-insulin dependant diabetes mellitus (NIDDM) patients with nephropathy were 

30 compared with 198 ethnically matched healthy subjects in example 3 and 147 NIDDM patients 
without nephropathy. Genotyping was carried out as in example 1 . 

The IL-1 (441 12332) haplotype was carried by 24 of 79 of the NIDDM 
nephropathy patients and 25 of 141 NIDDM without nephropathy patients. However, the second 
haplotype (3322146 1) was not found in the nephropathy patients (0/8 1). The IL-1 (441 12332) 

35 haplotype was significantly over represented in the patient group compared with the healthy 
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control group (24/79 vs. 34/198; p = 0.015) and the NIDDM without nephropathy group (24/79 
vs. 25/141; p = 0.03). 

Example 5 An IL-1 Haplotype is Associated with Inflammatory Disease 
5 This is a prophetic example. Other diseases are examined as per Example 4. The 

IL-1 (441 12332) haplotype is found to be associated with coronary artery disease, osteoporosis, 
nephropathy in diabetes mellitus, alopecia areata, Graves disease, systemic lupus erythematosus, 
lichen sclerosis and ulcerative colitis. 

Likewise, the IL-1 (33221461) haplotype is associated with periodontal disease, 
10 juvenile chronic arthritis, psoriasis, insulin dependant diabetes and diabetic retinopathy. 

Example 6 Novel Markers are Linked to an IL-1 Haplotype 

This is a prophetic example. Additional markers are identified by sequence and 
restriction enzyme analysis of the 2ql3-14 region. These new markers are identified as belonging 
15 to an IL-1 haplotype in the manner described in Examples 2 and 3. 

Example 7 The IL-1 ( 441 12332) Haplotype Is Used to Predict Disease Susceptibility 

This is a prophetic example. A patient with a family history of ulcerative colitis is 

genotyped for the presence of the IL-1 (441 12332) haplotype. Genotyping is performed as in 
20 Example I and the patient is determined to carry one or more alleles of the haplotype. The patient 

is thus treated with IL-1 antagonists to prevent disease. 

A second patient with a family history of coronary artery disease is genotyped at 

the IL-1 gene cluster. The patient is found to carry one or more alleles; of the IL-1 (441 12332) 

haplotype and be homozygous for the VNTR allele 2. Thus, the patient is 5.4 times as likely to 
25 develop coronary artery disease as the general population and is treated vigorously to prevent 

disease. 

Example 8 Additional Haplotypes are Statistically Significant 

This is a prophetic example. An additional 400 chromosomes are typed as per 
30 Example 1 and linkage disequilibrium assessed as per Example 2. The IL-1 (33221461) 
haplotype is found to be present about 4 times more frequently than expected (p - 0,05). 

In a similar manner, the following markers are determined to be present in the IL- 
1 (441 12332) haplotype (p « 0.05). 

allele 2 of the 1731 marker of the IL1RN gene 
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allele 2 of the 1812 marker of the IL1RN gene 
allele 2 of the 1868 marker of the IL1RN gene 
allele 2 of the 1887 marker of the IL1RN gene 
allele 2 of the 8006 marker of the IL1RN gene 
allele 2 of the 8061 marker of the IL1RN gene 
allele 2 of the 9589 marker of the IL1RN gene 



